Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

2,432

Full-text search

Active filters: multimodal

ByteDance/Dolphin-v2

Image-Text-to-Text • 4B • Updated 5 days ago • 746 • 73

allenai/Molmo2-8B

Video-Text-to-Text • 9B • Updated 1 day ago • 27 • 34

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22 • 287k • 766

jinaai/jina-vlm

Image-Text-to-Text • 2B • Updated 12 days ago • 1.86k • 80

allenai/Molmo2-4B

Video-Text-to-Text • 5B • Updated 1 day ago • 8 • 14

microsoft/Fara-7B

Image-Text-to-Text • 8B • Updated 6 days ago • 114k • 443

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 295k • 457

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 3.06M • • 1.39k

allenai/Molmo2-O-7B

Video-Text-to-Text • 8B • Updated 1 day ago • 9

allenai/Molmo2-VideoPoint-4B

Video-Text-to-Text • 5B • Updated about 22 hours ago • 9

OctoMed/OctoMed-7B

Image-Text-to-Text • 8B • Updated 11 days ago • 1.09k • 16

Cognitive-Lab/NetraEmbed

Visual Document Retrieval • 4B • Updated 7 days ago • 622 • 22

Qwen/Qwen3-Omni-30B-A3B-Thinking

Any-to-Any • 32B • Updated Sep 22 • 61.1k • 235

Qwen/Qwen3-Omni-30B-A3B-Captioner

Any-to-Any • 32B • Updated Sep 22 • 24.8k • 181

Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Jan 12 • 1.84M • 475

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6 • 1.41M • • 1.25k

allenai/Molmo-7B-D-0924

Image-Text-to-Text • 8B • Updated 2 days ago • 38.1k • 557

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6 • 6.05M • 575

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6 • 107k • • 572

cpatonn/Qwen3-Omni-30B-A3B-Instruct-AWQ-4bit

Any-to-Any • 10B • Updated Sep 28 • 29.8k • 36

stepfun-ai/GELab-Zero-4B-preview

Image-to-Text • 4B • Updated 16 days ago • 1.16k • 95

ServiceNow-AI/Apriel-1.6-15b-Thinker-GGUF

14B • Updated about 18 hours ago • 3

thesby/Qwen3-VL-8B-NSFW-Caption-V4.5

Image-to-Text • 9B • Updated Nov 7 • 20.2k • 59

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • 8B • Updated Oct 25, 2024 • 27.2k • 120

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 154k • 1.83k

unsloth/Qwen2.5-VL-7B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated May 12 • 69.3k • 109

TencentARC/ARC-Qwen-Video-7B-Narrator

Video-Text-to-Text • 9B • Updated Sep 21 • 67 • 9

IDEA-Research/Rex-Omni

Image-Text-to-Text • 4B • Updated Oct 16 • 22k • 50

bytedance-research/Vidi-7B

9B • Updated 1 day ago • 588 • 10

VITRA-VLA/VITRA-VLA-3B

Robotics • Updated 17 days ago • 2