Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation Paper • 2602.12125 • Published Feb 12 • 60 • 3
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published 22 days ago • 147 • 5
UniG2U-Bench: Do Unified Models Advance Multimodal Understanding? Paper • 2603.03241 • Published 21 days ago • 86 • 4
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 28 days ago • 99 • 4
MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models Paper • 2602.17602 • Published Feb 19 • 56 • 4
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models Paper • 2602.22859 • Published 26 days ago • 151 • 4
The Trinity of Consistency as a Defining Principle for General World Models Paper • 2602.23152 • Published 26 days ago • 199 • 5
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference Paper • 2602.21548 • Published 27 days ago • 47 • 5
Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter Paper • 2511.16665 • Published Nov 20, 2025 • 1
SkillOrchestra: Learning to Route Agents via Skill Transfer Paper • 2602.19672 • Published 29 days ago • 56 • 6
From Scale to Speed: Adaptive Test-Time Scaling for Image Editing Paper • 2603.00141 • Published 28 days ago • 138 • 5
Heterogeneous Agent Collaborative Reinforcement Learning Paper • 2603.02604 • Published 21 days ago • 188 • 7
RubricBench: Aligning Model-Generated Rubrics with Human Standards Paper • 2603.01562 • Published 22 days ago • 60 • 4
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 21 days ago • 100 • 6
Helios: Real Real-Time Long Video Generation Model Paper • 2603.04379 • Published 20 days ago • 174 • 6