Training datasets and released checkpoints for RLVR from the CVPR 2026 paper “ReLaX: Reasoning with Latent Exploration for Large Reasoning Models”
Shimin Zhang
SteveZ25
AI & ML interests
None yet
Recent Activity
updated
a dataset 3 days ago
SteveZ25/MMLU-VeRL published
a dataset 3 days ago
SteveZ25/MMLU-VeRL updated
a dataset 5 days ago
SteveZ25/CSQA