HuangLab Test

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

russwang authored a paper about 1 month ago

Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following

Benjamin-eecs authored a paper about 2 months ago

Scaling Agent Learning via Experience Synthesis

russwang authored a paper 2 months ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

View all activity

russwang

authored a paper about 1 month ago

Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following

Paper • 2511.21662 • Published Nov 26, 2025 • 11

Benjamin-eecs

authored a paper about 2 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 81

russwang

authored a paper 2 months ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published Nov 3, 2025 • 31

Benjamin-eecs

authored a paper 2 months ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published Oct 28, 2025 • 17

Benjamin-eecs

authored 2 papers 3 months ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 36

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 270

russwang

authored 2 papers 3 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 270

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

Paper • 2509.23661 • Published Sep 28, 2025 • 47

Benjamin-eecs

authored 3 papers 3 months ago

Benjamin-eecs

authored a paper 4 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 84

russwang

authored 3 papers 4 months ago

What makes Reasoning Models Different? Follow the Reasoning Leader for Efficient Decoding

Paper • 2506.06998 • Published Jun 8, 2025 • 1

CaughtCheating: Is Your MLLM a Good Cheating Detective? Exploring the Boundary of Visual Perception and Reasoning

Paper • 2507.00045 • Published Jun 23, 2025 • 1

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 84

haitaominlp

authored a paper 4 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27, 2025 • 84

freesunshine0316

authored 4 papers 6 months ago

The Trickle-down Impact of Reward (In-)consistency on RLHF

Paper • 2309.16155 • Published Sep 28, 2023 • 1

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Paper • 2407.00617 • Published Jun 30, 2024 • 7

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 40

HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving

Paper • 2412.20735 • Published Dec 30, 2024 • 11

AI & ML interests

Recent Activity

Team members 6

RussWang96's activity