AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training
OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
models 10
OpenRubrics/RubricRM-4B-Rubric
196k • Updated • 6
OpenRubrics/RubricRM-4B-Judge
196k • Updated • 11
OpenRubrics/RubricRM-4B-Rubric-v2
196k • Updated • 8
OpenRubrics/RubricRM-8B-Judge
308k • Updated • 8
OpenRubrics/RubricRM-8B-Rubric
308k • Updated • 17
OpenRubrics/RubricRM-8B-Judge-v2
308k • Updated • 19
OpenRubrics/RubricRM-8B-Rubric-v2
308k • Updated • 20
OpenRubrics/RubricRM-4B-Judge-v2
196k • Updated • 4 • 1
OpenRubrics/RubricARM-8B-Rubric
308k • Updated • 13 • 3
OpenRubrics/RubricARM-8B-Judge
308k • Updated • 146 • 3