OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published 13 days ago • 97
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 16 days ago • 157
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 19 days ago • 215
SWE-chat: Coding Agent Interactions From Real Users in the Wild Paper • 2604.20779 • Published 27 days ago • 15
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published Apr 8 • 121
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published Apr 13 • 102
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 324