Vision
updated
liuhaotian/llava-v1.6-34b
Image-Text-to-Text
• 35B • Updated • 32.4k
• 362
deepseek-ai/deepseek-vl-7b-base
7B • Updated • 87
• 65
deepseek-ai/deepseek-vl-7b-chat
Image-Text-to-Text
• 7B • Updated • 3.5k
• 270
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
• 8B • Updated • 123k
• 621
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text
• 8B • Updated • 74
• 95
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text
• 8B • Updated • 1.21k
• 28
google/paligemma-3b-pt-896
Image-Text-to-Text
• 3B • Updated • 636
• 124
microsoft/Phi-3-vision-128k-instruct
Text Generation
• Updated • 104k
• 970
Image-Text-to-Text
• 7B • Updated • 89.6k
• 200
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
• Updated • 1.47M
• 730
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
• 11B • Updated • 12.8k
• 586
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
• 11B • Updated • 177k
• 1.58k
meta-llama/Llama-3.2-90B-Vision
Image-Text-to-Text
• 89B • Updated • 2.61k
• 134
meta-llama/Llama-3.2-90B-Vision-Instruct
Image-Text-to-Text
• 89B • Updated • 12.2k
• 355
meta-llama/Llama-Guard-3-11B-Vision
Image-Text-to-Text
• 11B • Updated • 2.03k
• 71
Image-Text-to-Text
• 73B • Updated • 5.31k
• 298
Image-Text-to-Text
• 8B • Updated • 20.2k
• 565
Image-Text-to-Text
• 8B • Updated • 1.29k
• 163
Image-Text-to-Text
• Updated • 5.56k
• 157
Text-to-Video
• Updated • 5.33k
• • 1.32k
Image-Text-to-Text
• Updated • 246
• 1.71k
Image-to-Video
• Updated • 549k
• • 2.16k