Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning Paper • 2602.11149 • Published Feb 11 • 17
Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning Paper • 2602.11149 • Published Feb 11 • 17
Same Content, Different Answers: Cross-Modal Inconsistency in MLLMs Paper • 2512.08923 • Published Dec 9, 2025 • 1
Same Content, Different Answers: Cross-Modal Inconsistency in MLLMs Paper • 2512.08923 • Published Dec 9, 2025 • 1
Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning Paper • 2507.14137 • Published Jul 18, 2025 • 36
Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning Paper • 2507.14137 • Published Jul 18, 2025 • 36
KV Cache Steering for Inducing Reasoning in Small Language Models Paper • 2507.08799 • Published Jul 11, 2025 • 40
KV Cache Steering for Inducing Reasoning in Small Language Models Paper • 2507.08799 • Published Jul 11, 2025 • 40
Small Visual Language Models can also be Open-Ended Few-Shot Learners Paper • 2310.00500 • Published Sep 30, 2023
A critical analysis of self-supervision, or what we can learn from a single image Paper • 1904.13132 • Published Apr 30, 2019