39 36 21

Denis Kuznedelev

SpiridonSunRotator

https://github.com/Godofnothing

Godofnothing

AI & ML interests

Model compression, computer vision, NLP

Recent Activity

liked a model 9 days ago

AlexWortega/ml-intern-v4-100m-tinystories-20260512-1721

upvoted a paper 11 days ago

MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

upvoted a paper about 2 months ago

Reasoning Shift: How Context Silently Shortens LLM Reasoning

View all activity

Organizations

upvoted a paper 11 days ago

MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

Paper • 2605.07850 • Published 15 days ago • 18

upvoted a paper about 2 months ago

Reasoning Shift: How Context Silently Shortens LLM Reasoning

Paper • 2604.01161 • Published Apr 1 • 32

upvoted an article 3 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

•

Jan 27

• 75

upvoted 3 papers 3 months ago

LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding

Paper • 2602.23881 • Published Feb 27 • 18

MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization

Paper • 2602.03537 • Published Feb 3 • 5

DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

Paper • 2602.02016 • Published Feb 2 • 13

upvoted a paper 4 months ago

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Paper • 2601.22813 • Published Jan 30 • 61

upvoted 2 papers 6 months ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published Nov 30, 2025 • 23

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 129

upvoted an article 6 months ago

Article

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

NormalUhr

•

Feb 4, 2025

• 35

upvoted an article 7 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

abidlabs, znation, nouamanetazi, sasha, qgallouedec

•

Jul 29, 2025

• 223

upvoted a paper 8 months ago

Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

Paper • 2509.23202 • Published Sep 27, 2025 • 30

upvoted 2 papers 10 months ago

The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Paper • 2507.18553 • Published Jul 24, 2025 • 42

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17, 2025 • 126

upvoted a paper 11 months ago

MADrive: Memory-Augmented Driving Scene Modeling

Paper • 2506.21520 • Published Jun 26, 2025 • 36

upvoted 5 papers 12 months ago

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7, 2025 • 71

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27, 2025 • 144

Unified Scaling Laws for Compressed Representations

Paper • 2506.01863 • Published Jun 2, 2025 • 19

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Paper • 2505.19297 • Published May 25, 2025 • 85

Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models

Paper • 2505.16134 • Published May 22, 2025 • 18

Denis Kuznedelev

AI & ML interests

Recent Activity

Organizations

SpiridonSunRotator's activity

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face