Rethinking Training Dynamics in Scale-wise Autoregressive Generation Paper • 2512.06421 • Published 5 days ago • 3
Rethinking Training Dynamics in Scale-wise Autoregressive Generation Paper • 2512.06421 • Published 5 days ago • 3
Rethinking Training Dynamics in Scale-wise Autoregressive Generation Paper • 2512.06421 • Published 5 days ago • 3 • 2
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts Paper • 2412.05552 • Published Dec 7, 2024 • 6
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation Paper • 2412.06781 • Published Dec 9, 2024 • 24
Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion Paper • 2412.09593 • Published Dec 12, 2024 • 18
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts Paper • 2412.05552 • Published Dec 7, 2024 • 6
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts Paper • 2412.05552 • Published Dec 7, 2024 • 6 • 2
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel Paper • 2412.08467 • Published Dec 11, 2024 • 6
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search Paper • 2408.08152 • Published Aug 15, 2024 • 60
view article Article DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive Apr 9, 2024 • 30