KuraKura AI

community

kurakurai

Activity Feed

AI & ML interests

Sea Turtles

Recent Activity

MaxLSB authored a paper 4 days ago

Rethinking the Multilingual Reasoning Gap with Layer Swap

GAD-cell new activity 3 months ago

kurakurai/Luth-LFM2-350M-GGUF:adds Q4_KM for mobile compatible

GAD-cell new activity 3 months ago

kurakurai/Luth-LFM2-700M-GGUF:adds Q4 KM gguf file for mobile compatibility

View all activity

MaxLSB

authored a paper 4 days ago

Rethinking the Multilingual Reasoning Gap with Layer Swap

Paper • 2605.26735 • Published 7 days ago • 2

mlabonne

posted an update about 1 month ago

Post

2212

Big update to llm-datasets, my curated list of datasets and tools for post-training LLMs.

> Added many new datasets
> New "thinking" column
> Refreshed recommended tools.

Thanks to everyone who told me they used it for their research at ICLR, you motivated this update!

2 replies

GAD-cell

in kurakurai/Luth-LFM2-350M-GGUF 3 months ago

adds Q4_KM for mobile compatible

#1 opened 3 months ago by

Tonic

GAD-cell

in kurakurai/Luth-LFM2-700M-GGUF 3 months ago

adds Q4 KM gguf file for mobile compatibility

#1 opened 3 months ago by

Tonic

mlabonne

authored 2 papers 4 months ago

LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 61

Zero-Overhead Introspection for Adaptive Test-Time Compute

Paper • 2512.01457 • Published Dec 1, 2025 • 3

mlabonne

posted an update 5 months ago

Post

10389

New family of 1B models just dropped!

> LiquidAI/LFM2.5-1.2B-Base: 10T → 28T tokens
> LiquidAI/LFM2.5-1.2B-Instruct: new large-scale multi-stage RL
> LiquidAI/LFM2.5-1.2B-JP: our most polite model
> LiquidAI/LFM2.5-VL-1.6B: multi-image multilingual
> LiquidAI/LFM2.5-Audio-1.5B: 8x times faster, no quality loss

Super proud of this release 🤗

3 replies

MaxLSB

authored a paper 8 months ago

Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer

Paper • 2510.05846 • Published Oct 7, 2025 • 3

GAD-cell

updated 2 datasets 8 months ago

kurakurai/scholar

Viewer • Updated Oct 12, 2025 • 60.6k • 253 • 8

kurakurai/luth-sft

Viewer • Updated Oct 12, 2025 • 571k • 782 • 15

GAD-cell

updated 5 models 8 months ago

authored a paper 8 months ago

Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer

Paper • 2510.05846 • Published Oct 7, 2025 • 3

mlabonne

posted an update 8 months ago

Post

8463

LiquidAI/LFM2-8B-A1B just dropped!

8.3B params with only 1.5B active/token 🚀

> Quality ≈ 3–4B dense, yet faster than Qwen3-1.7B
> MoE designed to run on phones/laptops (llama.cpp / vLLM)
> Pre-trained on 12T tokens → strong math/code/IF

1 reply

mlabonne

posted an update 8 months ago

Post

3910

⚛️ New drop of tiny task-specific models!

Want to do data extraction, translation, RAG, tool use, or math on a Raspberry Pi? We got you covered! ✅

These tiny models were fine-tuned to perform narrow tasks extremely well, making them competitive with much larger models.

You can deploy them today on-device or even on GPUs for big data operations!

LiquidAI/liquid-nanos-68b98d898414dd94d4d5f99a

1 reply

MaxLSB

updated 2 models 8 months ago

kurakurai/Luth-LFM2-350M

Text Generation • 0.4B • Updated Oct 12, 2025 • 258 • 15

kurakurai/Luth-LFM2-700M

Text Generation • 0.7B • Updated Oct 12, 2025 • 141 • 16

AI & ML interests

Recent Activity

Team members 4

kurakurai's activity

adds Q4_KM for mobile compatible

adds Q4 KM gguf file for mobile compatibility