mradermacher/A2Search-3B-Instruct-i1-GGUF Reinforcement Learning • 3B • Updated Dec 4, 2025 • 269 • 1