2 83 16

Yuxin Zuo

yuxinzuo

AI & ML interests

None yet

Recent Activity

upvoted a paper 44 minutes ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

upvoted a paper 4 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

liked a model 7 days ago

lllyx/Qwen3-4B-Base-GRPO

View all activity

Organizations

None yet

upvoted a paper 44 minutes ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published 1 day ago • 16

upvoted a paper 4 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 6 days ago • 149

liked 2 models 7 days ago

lllyx/Qwen3-4B-Base-GRPO

Text Generation • 4B • Updated 16 days ago • 187 • 3

lllyx/Qwen3-1.7B-SFT

Text Generation • 2B • Updated 7 days ago • 645 • 3

upvoted a paper 10 days ago

MiA-Signature: Approximating Global Activation for Long-Context Understanding

Paper • 2605.06416 • Published 12 days ago • 54

upvoted a paper 22 days ago

From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company

Paper • 2604.22446 • Published 25 days ago • 121

liked 2 models 26 days ago

Qwen/Qwen3.6-27B

Image-Text-to-Text • 28B • Updated 25 days ago • 3.55M • • 1.33k

Qwen/Qwen3.6-35B-A3B

Image-Text-to-Text • 36B • Updated 25 days ago • 5.61M • • 1.82k

upvoted a paper about 1 month ago

Frontier-Eng: Benchmarking Self-Evolving Agents on Real-World Engineering Tasks with Generative Optimization

Paper • 2604.12290 • Published Apr 14 • 16

authored 3 papers about 1 month ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 27

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 95

upvoted 4 papers about 1 month ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 95

upvoted 3 papers 2 months ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 184

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 426

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59

upvoted a paper 3 months ago

Qwen3-Coder-Next Technical Report

Paper • 2603.00729 • Published Feb 28 • 64

Yuxin Zuo

AI & ML interests

Recent Activity

Organizations

yuxinzuo's activity