Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Agent-RL's picture
4 5

Agent-RL

agentrl
xinrihui's profile picture kristaller486's profile picture wul8's profile picture
·
https://github.com/Agent-RL
  • agentrl

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 7 months ago

Surrogate Signals from Format and Length: Reinforcement Learning for Solving Mathematical Problems without Ground Truth Answers

Paper • 2505.19439 • Published May 26, 2025 • 30
upvoted an article 7 months ago
view article
Article

The 4 Things Qwen-3’s Chat Template Teaches Us

Apr 30, 2025
•
81
upvoted a collection 9 months ago

ReSearch

Collection
Trained models as described in the paper "ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning" • 5 items • Updated Mar 27, 2025 • 7
upvoted a paper 9 months ago

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Paper • 2503.19470 • Published Mar 25, 2025 • 19
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs