mradermacher/SEOcrate-4B_grpo_new_01-GGUF Reinforcement Learning • 4B • Updated Jul 11, 2025 • 166 • 1
mradermacher/SEOcrate-4B_grpo_new_01-i1-GGUF Reinforcement Learning • 4B • Updated Jul 11, 2025 • 917
Inigomf/Llama-3.1-8B-FinAdvisor-MechInterp-DO-NOT-USE-FOR-FINANCIAL-ADVICE Text Generation • 8B • Updated Feb 11 • 3