ArxivBench: Can LLMs Assist Researchers in Conducting Research? Paper • 2504.10496 • Published Apr 6, 2025 • 2
Can Agent Conquer Web? Exploring the Frontiers of ChatGPT Atlas Agent in Web Games Paper • 2510.26298 • Published Oct 30, 2025 • 45