-
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 83 -
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
Paper • 2509.06501 • Published • 79 -
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
Paper • 2509.02544 • Published • 124 -
Baichuan-M2: Scaling Medical Capability with Large Verifier System
Paper • 2509.02208 • Published • 42
Henry Mayo
hal90000
·
AI & ML interests
I want to have it all
Recent Activity
liked
a model
about 2 months ago
WeiboAI/VibeThinker-1.5B
liked
a model
about 2 months ago
zai-org/GLM-4.6-FP8
liked
a model
2 months ago
MiniMaxAI/MiniMax-M2
Organizations
None yet