PC-GRPO Collection Qwen2.5-VL-3B & 7B models trained with PC-GRPO in the paper: Puzzle Curriculum GRPO for Vision-Centric Reasoning • 9 items • Updated about 1 month ago • 3