MediaTek-Research/Breeze-ASR-25 Automatic Speech Recognition β’ 2B β’ Updated Jul 8, 2025 β’ 5.39k β’ 89
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models Paper β’ 2511.11007 β’ Published Nov 14, 2025 β’ 15
view article Article Weβre open-sourcing our text-to-image model and the process behind it Nov 12, 2025 β’ 76
nvidia/diar_streaming_sortformer_4spk-v2 Automatic Speech Recognition β’ Updated about 11 hours ago β’ 8.83k β’ 86
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs Paper β’ 2508.16153 β’ Published Aug 22, 2025 β’ 160
facebook/dinov3-vit7b16-pretrain-lvd1689m Image Feature Extraction β’ 7B β’ Updated Aug 19, 2025 β’ 12.6k β’ 198
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment Paper β’ 2507.20984 β’ Published Jul 28, 2025 β’ 57
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders Jul 9, 2025 β’ 745