[ICLR 2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models
Zengbin Wang
MuMing0102
·
AI & ML interests
Agentic AI, Multimodal LLM, Computer Vision
Recent Activity
authored a paper 1 day ago
Visually-Guided Policy Optimization for Multimodal Reasoning updated a collection 2 days ago
SpatialGenEval updated a collection 2 days ago
VGPO-RLOrganizations
None yet