1 22 164

Snehal

spate141

https://snehal.ai/

spate141

AI & ML interests

None yet

Recent Activity

liked a model 18 days ago

unsloth/LTX-2.3-GGUF

upvoted a collection 25 days ago

DFlash

liked a model 28 days ago

Qwen/Qwen3.6-27B-FP8

View all activity

Organizations

None yet

upvoted a collection 25 days ago

DFlash

Collection

Block Diffusion for Flash Speculative Decoding • 21 items • Updated 11 days ago • 117

upvoted a collection 4 months ago

Qwen3-Coder-Next

Collection

4 items • Updated Feb 3 • 125

upvoted 2 articles 5 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

itazap, ariG23498, ArthurZ, sergiopaniego, merve, pcuenq

•

Dec 18, 2025

• 124

Article

The Optimal Architecture for Small Language Models

codelion

•

Dec 26, 2025

• 121

upvoted a collection 5 months ago

Gemma 3n

Collection

4 items • Updated Mar 12 • 269

upvoted an article 6 months ago

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 389

upvoted an article 7 months ago

Article

Supercharge your OCR Pipelines with Open Models

merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq

•

Oct 21, 2025

• 313

upvoted an article 10 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 776

upvoted an article 11 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

thomwolf, matthieu-lapeyre

•

Jul 9, 2025

• 800

upvoted 2 collections 12 months ago

LLaVa-NeXT

Collection

LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. • 8 items • Updated Jul 19, 2024 • 34

LLaVA-1.6

Collection

A collection of LLaVA-1.6 checkpoints • 4 items • Updated Jan 31, 2024 • 75

upvoted an article 12 months ago

Article

Uncensor any LLM with abliteration

mlabonne

•

Jun 13, 2024

• 855

upvoted an article about 1 year ago

Article

🪆 Introduction to Matryoshka Embedding Models

tomaarsen, Xenova, osanseviero

•

Feb 23, 2024

• 208

upvoted a paper about 1 year ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 99

upvoted a collection about 1 year ago

Qwen3

Collection

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 70 items • Updated 28 days ago • 272

upvoted 3 articles over 1 year ago

Article

Let's talk about LLM evaluation

clefourrier

•

May 23, 2024

• 209

Article

Open-source DeepResearch – Freeing our search agents

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k

Article

They Said It Couldn’t Be Done

Pclanglais

•

Dec 5, 2024

• 91

upvoted a collection almost 2 years ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 711

upvoted an article almost 2 years ago

Article

Training and Finetuning Embedding Models with Sentence Transformers

tomaarsen

•

May 28, 2024

• 275

Snehal

AI & ML interests

Recent Activity

Organizations

spate141's activity

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

The Optimal Architecture for Small Language Models

Continuous batching from first principles

Supercharge your OCR Pipelines with Open Models

SmolLM3: smol, multilingual, long-context reasoner

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Uncensor any LLM with abliteration

🪆 Introduction to Matryoshka Embedding Models

Let's talk about LLM evaluation

Open-source DeepResearch – Freeing our search agents

They Said It Couldn’t Be Done

Training and Finetuning Embedding Models with Sentence Transformers