Qi Chen-SII

qc316

https://scholar.google.com/citations?user=NPePREMAAAAJ

Moon0316

AI & ML interests

SII is an institution dedicated to innovation in education and research in the field of AI. Qi Chen is part of SII, focusing on multimodal learning.

Recent Activity

upvoted a paper 23 days ago

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

upvoted a paper 24 days ago

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

upvoted a paper about 1 month ago

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

View all activity

Organizations

upvoted a paper 23 days ago

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published 23 days ago • 73

upvoted a paper 24 days ago

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

Paper • 2512.11799 • Published 26 days ago • 29

upvoted 2 papers about 1 month ago

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

Paper • 2512.03036 • Published Dec 2, 2025 • 21

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

Paper • 2511.22663 • Published Nov 27, 2025 • 29

upvoted a paper 2 months ago

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28, 2025 • 18

upvoted a paper 3 months ago

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Paper • 2510.18701 • Published Oct 21, 2025 • 66

liked a Space 5 months ago

3DGen Leaderboard

😻

Display 3D model evaluation leaderboard

upvoted 3 papers 5 months ago

Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity

Paper • 2508.05609 • Published Aug 7, 2025 • 29

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Paper • 2508.04700 • Published Aug 6, 2025 • 52

LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation

Paper • 2508.03694 • Published Aug 5, 2025 • 51

upvoted 2 papers 7 months ago

Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better

Paper • 2506.09040 • Published Jun 10, 2025 • 34

Video World Models with Long-term Spatial Memory

Paper • 2506.05284 • Published Jun 5, 2025 • 55

upvoted a paper 8 months ago

Visual Agentic Reinforcement Fine-Tuning

Paper • 2505.14246 • Published May 20, 2025 • 32

upvoted a collection 8 months ago

UnifiedReward 1.0 Qwen2.5VL Models

Collection

6 items • Updated Nov 6, 2025 • 10

upvoted a paper 8 months ago

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6, 2025 • 92

upvoted 2 papers 10 months ago

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Paper • 2503.21755 • Published Mar 27, 2025 • 33

CineBrain: A Large-Scale Multi-Modal Brain Dataset During Naturalistic Audiovisual Narrative Processing

Paper • 2503.06940 • Published Mar 10, 2025 • 11

upvoted a collection 10 months ago

UnifiedReward Training Data

Collection

14 items • Updated Nov 6, 2025 • 6

liked a dataset about 1 year ago

Fudan-fMRI/fMRI-Shape

Viewer • Updated Aug 15, 2025 • 1.4k • 3.5k • 10

upvoted a paper about 1 year ago

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Paper • 2412.09645 • Published Dec 10, 2024 • 36

Qi Chen-SII

AI & ML interests

Recent Activity

Organizations

qc316's activity

3DGen Leaderboard