Rethinking the Multilingual Reasoning Gap with Layer Swap Paper • 2605.26735 • Published 7 days ago • 2
view post Post 2212 Big update to llm-datasets, my curated list of datasets and tools for post-training LLMs.> Added many new datasets> New "thinking" column> Refreshed recommended tools.Thanks to everyone who told me they used it for their research at ICLR, you motivated this update! See translation 2 replies · 👀 3 3 🤗 3 3 👍 3 3 + Reply
Zero-Overhead Introspection for Adaptive Test-Time Compute Paper • 2512.01457 • Published Dec 1, 2025 • 3
view post Post 10389 New family of 1B models just dropped!> LiquidAI/LFM2.5-1.2B-Base: 10T → 28T tokens> LiquidAI/LFM2.5-1.2B-Instruct: new large-scale multi-stage RL> LiquidAI/LFM2.5-1.2B-JP: our most polite model> LiquidAI/LFM2.5-VL-1.6B: multi-image multilingual> LiquidAI/LFM2.5-Audio-1.5B: 8x times faster, no quality lossSuper proud of this release 🤗 See translation 3 replies · 🚀 18 18 👀 1 1 + Reply
Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer Paper • 2510.05846 • Published Oct 7, 2025 • 3
Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer Paper • 2510.05846 • Published Oct 7, 2025 • 3
view post Post 8463 LiquidAI/LFM2-8B-A1B just dropped!8.3B params with only 1.5B active/token 🚀> Quality ≈ 3–4B dense, yet faster than Qwen3-1.7B> MoE designed to run on phones/laptops (llama.cpp / vLLM)> Pre-trained on 12T tokens → strong math/code/IF See translation 1 reply · 🔥 9 9 🚀 3 3 + Reply
view post Post 3910 ⚛️ New drop of tiny task-specific models!Want to do data extraction, translation, RAG, tool use, or math on a Raspberry Pi? We got you covered! ✅These tiny models were fine-tuned to perform narrow tasks extremely well, making them competitive with much larger models.You can deploy them today on-device or even on GPUs for big data operations! LiquidAI/liquid-nanos-68b98d898414dd94d4d5f99a See translation 1 reply · 🔥 5 5 👍 2 2 😎 1 1 + Reply