Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models Paper • 2506.01413 • Published Jun 2, 2025 • 17
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published Dec 11, 2025 • 117
Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts Paper • 2601.22156 • Published Jan 29 • 14
Zero-Mistral 24B Collection Zero-Mistral-24B is our latest Russian language adapted version of Mistral-Small-3.1 • 2 items • Updated Apr 24, 2025 • 1
LLM datasets Collection Комбинации из проверенных хороших и полезных датасетов для обучения LLM • 5 items • Updated Apr 18, 2025 • 1
Zero-Mistral-Small-24B-Instruct-2501 Collection Zero-Mistral-Small is an improved version of mistralai/Mistral-Small-24B-Instruct-2501, primarily adapted for Russian and English languages. • 6 items • Updated 10 days ago • 1