Visually-Guided Policy Optimization for Multimodal Reasoning Paper • 2604.09349 • Published Apr 10 • 2