The Agent's First Day: Benchmarking Learning, Exploration, and Scheduling in the Workplace Scenarios Paper • 2601.08173 • Published 14 days ago • 9
RE-Searcher: Robust Agentic Search with Goal-oriented Planning and Self-reflection Paper • 2509.26048 • Published Sep 30, 2025 • 7
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks Paper • 2510.08002 • Published Oct 9, 2025 • 23
TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving Paper • 2504.15780 • Published Apr 22, 2025 • 6
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning Paper • 2503.07365 • Published Mar 10, 2025 • 61