Vision-Language Models Qwen/Qwen2.5-VL-7B-Instruct Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 5.1M • • 1.58k microsoft/Florence-2-large Image-Text-to-Text • 0.8B • Updated Aug 4, 2025 • 418k • 1.82k google/paligemma2-3b-pt-224 Image-Text-to-Text • 3B • Updated Dec 5, 2024 • 26.7k • 173
OCR & Document AI nvidia/nemotron-ocr-v2 Image-to-Text • Updated 25 days ago • 9.96k • 205 deepseek-ai/DeepSeek-OCR Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 1.69M • 3.28k zai-org/GLM-OCR Image-Text-to-Text • 1B • Updated 28 days ago • 2.58M • • 1.83k
Vision-Language Models Qwen/Qwen2.5-VL-7B-Instruct Image-Text-to-Text • 8B • Updated Apr 6, 2025 • 5.1M • • 1.58k microsoft/Florence-2-large Image-Text-to-Text • 0.8B • Updated Aug 4, 2025 • 418k • 1.82k google/paligemma2-3b-pt-224 Image-Text-to-Text • 3B • Updated Dec 5, 2024 • 26.7k • 173
OCR & Document AI nvidia/nemotron-ocr-v2 Image-to-Text • Updated 25 days ago • 9.96k • 205 deepseek-ai/DeepSeek-OCR Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 1.69M • 3.28k zai-org/GLM-OCR Image-Text-to-Text • 1B • Updated 28 days ago • 2.58M • • 1.83k