Multimodal Example - Search News

Solving multimodal medical device design challenges

The architecture of a multimodal system depends on the coordination of diverse hardware and software components into a single ...

Devdiscourse

How AI and blockchain are reinventing multimodal logistics for disruption-prone world

Read more about How AI and blockchain are reinventing multimodal logistics for disruption-prone world on Devdiscourse ...

14d

Black Forest Labs' new Self-Flow technique makes training multimodal AI models 2.8x more efficient

This efficiency makes it viable for enterprises to move beyond generic off-the-shelf solutions and develop specialized models ...

14d

Microsoft open-sources multimodal reasoning model with 15B parameters

The company mainly trained Phi-4-reasoning-vision-15B on open-source data. The data included images and text-based descriptions of the objects depicted in those images. Before it started training the ...

17d

DeepSeek V4 Adds Blackwell SM100 and FP4 Support for Lower-Cost Scaling

DeepSeek V4 ships native multimodal input with lower latency, plus support for Blackwell SM100 and FP4 compute scaling.

GitHub

Multimodal: llava dataset energon prompt changed

The multimodal examples suggested class 10 VQA. But the new llava dataset and energon prepare has updated the selections - class 10 is no longer VQA. Do you want to create a dataset.yaml interactively ...

IEEE

DataWink: Reusing and Adapting SVG-Based Visualization Examples with Large Multimodal Models

Abstract: Creating aesthetically pleasing data visualizations remains challenging for users without design expertise or familiarity with visualization tools. To address this gap, we present DataWink, ...

Microsoft

Magma: A Foundation Model for Multimodal AI Agents

We present Magma, a foundation model that serves multimodal AI agentic tasks in both the digital and physical worlds. Magma is a significant extension of vision-language (VL) models in that it not ...

The Robot Report

Ai2 says its Molmo 2 multimodal AI model can do more with less data

The Allen Institute for AI, also known as Ai2, last week released Molmo 2, its latest multimodel suite capable of precise spatial and temporal understanding of video, image, and multi-image sets.

TMCnet

Ai2 Releases Molmo 2: State-of-the-Art Open Multimodal Family for Video and Multi-Image Understanding

Ai2 (The Allen Institute for AI) today announced Molmo 2, a state-of-the-art open multimodal model suite capable of precise spatial and temporal understanding of video, image, and multi-image sets.

Morningstar

Ai2 Releases Molmo 2: State-of-the-Art Open Multimodal Family for Video and Multi-Image Understanding

New open models unlock deep video comprehension with novel features like video tracking and multi-image reasoning, accelerating the science of AI into a new generation of multimodal intelligence.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results