Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability Paper • 2602.02477 • Published 7 days ago • 9
Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training Paper • 2602.01511 • Published 8 days ago • 14
AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration Paper • 2602.03786 • Published 6 days ago • 83
SpatiaLab: Can Vision-Language Models Perform Spatial Reasoning in the Wild? Paper • 2602.03916 • Published 6 days ago • 11
Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning Paper • 2601.21037 • Published 12 days ago • 15
Balancing Understanding and Generation in Discrete Diffusion Models Paper • 2602.01362 • Published 8 days ago • 14
WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models Paper • 2602.02537 • Published 13 days ago • 5
MARS: Modular Agent with Reflective Search for Automated AI Research Paper • 2602.02660 • Published 7 days ago • 59
No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs Paper • 2602.02103 • Published 7 days ago • 66
Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation Paper • 2602.01756 • Published 8 days ago • 22
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System Paper • 2602.02488 • Published 7 days ago • 31
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper • 2602.00919 • Published 9 days ago • 265
Toward Cognitive Supersensing in Multimodal Large Language Model Paper • 2602.01541 • Published 8 days ago • 16
TTCS: Test-Time Curriculum Synthesis for Self-Evolving Paper • 2601.22628 • Published 11 days ago • 33
ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought Paper • 2601.23184 • Published 10 days ago • 34
ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas Paper • 2601.21558 • Published 12 days ago • 58
AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning Paper • 2601.18631 • Published 14 days ago • 47