Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 25 days ago • 79
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 25 days ago • 79
OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence Paper • 2604.07296 • Published 28 days ago • 39
The Art of Efficient Reasoning Collection Project: https://wutaiqiang.github.io/project/Art • 8 items • Updated Mar 18 • 2
The Art of Efficient Reasoning Collection Project: https://wutaiqiang.github.io/project/Art • 8 items • Updated Mar 18 • 2
The Art of Efficient Reasoning Collection Project: https://wutaiqiang.github.io/project/Art • 8 items • Updated Mar 18 • 2
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning Paper • 2603.00889 • Published Mar 1 • 56