3 7 15

Aryan Agal

aryanagal

grubdragon

AI & ML interests

None yet

Recent Activity

liked a Space 9 days ago

HuggingFaceFW/blogpost-fineweb-v1

liked a Space 9 days ago

nanotron/ultrascale-playbook

liked a Space 9 days ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

None yet

liked 3 Spaces 9 days ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.26k

Generate high-quality text data for LLMs using FineWeb

The Ultra-Scale Playbook

🌌

3.63k

The ultimate guide to training LLM on large GPU Clusters

The Smol Training Playbook

📚

2.82k

The secrets to building world-class LLMs

upvoted a paper 10 days ago

ZeRO: Memory Optimizations Toward Training Trillion Parameter Models

Paper • 1910.02054 • Published Oct 4, 2019 • 9

upvoted 2 collections about 1 month ago

Journal Club

Collection

Candidate papers to read in the H4 journal club • 54 items • Updated Apr 21, 2024 • 36

Papers We've Read

Collection

Papers discussed in the H4 journal club • 3 items • Updated Apr 12, 2024 • 10

New activity in google/gemma-3-270m 2 months ago

NaNs when fine-tuning

#4 opened 5 months ago by

cbudd

liked a model 5 months ago

Qwen/Qwen-Image-Edit

Image-to-Image • Updated Aug 25, 2025 • 46.9k • • 2.25k

upvoted 2 papers 5 months ago

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 291

Discrete Diffusion in Large Language and Multimodal Models: A Survey

Paper • 2506.13759 • Published Jun 16, 2025 • 43

liked 4 models 7 months ago

liked a model 8 months ago

deepseek-ai/DeepSeek-Prover-V2-671B

Text Generation • 685B • Updated Apr 30, 2025 • 588 • • 815

updated a model 9 months ago

aryanagal/parler-tts-mini-v1-Jenny-colab

Text Generation • 0.9B • Updated Apr 15, 2025 • 1

published a model 9 months ago

aryanagal/parler-tts-mini-v1-Jenny-colab

Text Generation • 0.9B • Updated Apr 15, 2025 • 1

updated a dataset 9 months ago

aryanagal/jenny-tts-6h-descriptions-v1

Viewer • Updated Apr 14, 2025 • 4k • 6

published a dataset 9 months ago

aryanagal/jenny-tts-6h-descriptions-v1

Viewer • Updated Apr 14, 2025 • 4k • 6

updated a dataset 9 months ago

aryanagal/jenny-tts-text-tags-6h-v1

Viewer • Updated Apr 14, 2025 • 4k • 4

Aryan Agal

AI & ML interests

Recent Activity

Organizations

aryanagal's activity

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

The Smol Training Playbook

NaNs when fine-tuning