view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 261
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains Paper • 2511.04962 • Published Nov 7, 2025 • 53
Running 3.62k The Ultra-Scale Playbook 🌌 3.62k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 2.76k The Smol Training Playbook 📚 2.76k The secrets to building world-class LLMs
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 158
DiffGuard: Text-Based Safety Checker for Diffusion Models Paper • 2412.00064 • Published Nov 25, 2024 • 3
DiffGuard: Text-Based Safety Checker for Diffusion Models Paper • 2412.00064 • Published Nov 25, 2024 • 3