Teacher Logits Collection Logits captured from large models to act as the teacher for distillation • 3 items • Updated 2 days ago • 7
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 6 items • Updated about 18 hours ago • 76
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano v3. • 7 items • Updated about 18 hours ago • 33
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 15 days ago • 127
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 Jul 5, 2024 • 304
NeMo Gym Collection Collection of RL verifiable data for NeMo Gym • 13 items • Updated about 18 hours ago • 28
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR Oct 23 • 62
Medical LLMs Collection A collection of fine-tuned open-llms on biomedical and clinical datasets • 6 items • Updated Oct 14 • 4