Xi'an Jiaotong University

university

http://en.xjtu.edu.cn/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

xufangzhi submitted a paper 2 days ago

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

xufangzhi authored a paper 4 days ago

MUR: Momentum Uncertainty guided Reasoning for Large Language Models

xufangzhi authored a paper 4 days ago

A Semantic Mention Graph Augmented Model for Document-Level Event Argument Extraction

View all activity

xufangzhi

submitted a paper to Daily Papers 2 days ago

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published 6 days ago • 54

xufangzhi

authored 11 papers 4 days ago

MUR: Momentum Uncertainty guided Reasoning for Large Language Models

Paper • 2507.14958 • Published Jul 20, 2025 • 47

A Semantic Mention Graph Augmented Model for Document-Level Event Argument Extraction

Paper • 2403.09721 • Published Mar 12, 2024 • 1

Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning

Paper • 2507.21892 • Published Jul 29, 2025 • 3

ChartSketcher: Reasoning with Multimodal Feedback and Reflection for Chart Understanding

Paper • 2505.19076 • Published May 25, 2025

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 72

$A^3$-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation

Paper • 2601.09274 • Published 28 days ago • 84

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

Paper • 2601.07779 • Published 29 days ago • 28

SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization

Paper • 2601.22491 • Published 12 days ago • 12

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Paper • 2602.02196 • Published 9 days ago • 32

Mind Reasoning Manners: Enhancing Type Perception for Generalized Zero-shot Logical Reasoning over Text

Paper • 2301.02983 • Published Jan 8, 2023

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published 6 days ago • 54

MichaelErchi

authored 2 papers 3 months ago

How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity

Paper • 2511.08487 • Published Nov 11, 2025 • 3

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Paper • 2511.14366 • Published Nov 18, 2025 • 17

JERRYPAN617

authored a paper 3 months ago

CF-CAM: Cluster Filter Class Activation Mapping for Reliable Gradient-Based Interpretability

Paper • 2504.00060 • Published Mar 31, 2025 • 1

qizekun

authored a paper 4 months ago

Reasoning in Space via Grounding in the World

Paper • 2510.13800 • Published Oct 15, 2025 • 15

MichaelErchi

authored a paper 6 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 269

MichaelErchi

authored 2 papers 7 months ago

Rethinking Verification for LLM Code Generation: From Generation to Testing

Paper • 2507.06920 • Published Jul 9, 2025 • 29

Coding Triangle: How Does Large Language Model Understand Code?

Paper • 2507.06138 • Published Jul 8, 2025 • 22

qizekun

authored a paper 7 months ago

DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge

Paper • 2507.04447 • Published Jul 6, 2025 • 45