4 15 42

sunyuhan

yuuhan

sunyuhan19981208

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

upvoted a paper 2 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a paper 2 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

View all activity

Organizations

upvoted 5 papers 2 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published 12 days ago • 97

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 8 days ago • 80

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published 15 days ago • 244

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 27 days ago • 194

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 6 days ago • 181

upvoted a paper 6 days ago

A^2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning

Paper • 2510.12838 • Published Oct 13 • 24

upvoted a paper about 2 months ago

BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese

Paper • 2504.19314 • Published Apr 27 • 7

upvoted a paper 3 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24 • 80

upvoted a paper 4 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 263

upvoted an article 5 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

996

upvoted 2 papers 6 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 103

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 142

upvoted a paper 7 months ago

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Paper • 2504.15521 • Published Apr 22 • 64

liked a dataset 10 months ago

HuggingFaceH4/numina-deepseek-r1-qwen-7b

Viewer • Updated Jan 25 • 40 • 50 • 39

upvoted an article 10 months ago

Article

Open R1: Update #2

Feb 10

•

218

liked a dataset about 1 year ago

shibing624/roleplay-zh-sharegpt-gpt4-data

Viewer • Updated Jun 26, 2024 • 6.58k • 209 • 65

liked a model about 1 year ago

dnhkng/RYS-XLarge

Text Generation • 78B • Updated Oct 11, 2024 • 111 • 85

upvoted a collection over 1 year ago

GLM-4

Collection

GLM-4 Open Models • 14 items • Updated Jun 30 • 125

liked a model over 1 year ago

BAAI/bge-m3

liked a Space over 1 year ago

MTEB Leaderboard

🥇

6.79k

Embedding Leaderboard

sunyuhan

AI & ML interests

Recent Activity

Organizations

yuuhan's activity

Mixture of Experts Explained

Open R1: Update #2

MTEB Leaderboard