tencent/Hy-MT1.5-1.8B-1.25bit
Translation • 2B • Updated • 459 • 31
None defined yet.
HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs
Efficient Agentic Reinforcement Learning with On-Policy Intrinsic Knowledge Boundary Enhancement