Zhicheng YANG's picture

Zhicheng YANG

yangzhch6

·

https://yangzhch6.github.io/

yangzhch6

AI & ML interests

reasoning with LLMs

Recent Activity

updated a model about 9 hours ago

yangzhch6/tool-verl-qwen1.5B-200step-0320

published a model about 9 hours ago

yangzhch6/tool-verl-qwen1.5B-200step-0320

updated a model 6 days ago

yangzhch6/maxrl-qwen3-4b-base-dapo-bs128-n16-stepp400

View all activity

Organizations

None yet

Collections 2

Papers 7

arxiv:2508.13755

arxiv:2506.05183

arxiv:2407.09887

arxiv:2406.01940

models 33

yangzhch6/tool-verl-qwen1.5B-200step-0320

2B • Updated about 9 hours ago

yangzhch6/maxrl-qwen3-4b-base-dapo-bs128-n16-stepp400

4B • Updated 6 days ago • 14

yangzhch6/Qwen2.5-Math-7B-Think32k

Text Generation • 8B • Updated 15 days ago • 15

yangzhch6/Qwen2.5-Math-7B-Think32k-Openr1ColdStart46k-Syn

333k • Updated 15 days ago • 13

yangzhch6/Qwen2.5-Math-7B-Think32k-Openr1ColdStart46k

333k • Updated 16 days ago • 11

yangzhch6/Qwen2.5-Math-7B-16k-Think-Synthesizer

8B • Updated Nov 10, 2025

yangzhch6/cuda-12.8-tar

Updated Oct 13, 2025

yangzhch6/cuda-12.8

Updated Oct 13, 2025

yangzhch6/Mirror-Verifier-1.5B

2B • Updated Sep 30, 2025

yangzhch6/Mirror-Verifier-7B

8B • Updated Sep 30, 2025

datasets 16

yangzhch6/Accordion-Thinking-Synthetic-Data

Viewer • Updated Feb 10 • 14.7k • 12

yangzhch6/DeepInformal-DeepTheorem-Synthetic

Viewer • Updated Nov 10, 2025 • 404k • 25 • 1

yangzhch6/DeepInformal-Openr1-Math-46K-Synthetic

Viewer • Updated Nov 10, 2025 • 165k • 19

yangzhch6/compare-openr1

Viewer • Updated Nov 10, 2025 • 45.8k • 8

yangzhch6/Align-Openr1-Math-46k

Viewer • Updated Nov 9, 2025 • 45.8k • 10

yangzhch6/DeepInformal-test

Viewer • Updated Oct 29, 2025 • 405 • 14

yangzhch6/DeepInformal-Putnam-1995-2024

Viewer • Updated Oct 28, 2025 • 356 • 7 • 1

yangzhch6/DeepInformal-DeepTheorem-DeepSeek-84k

Viewer • Updated Oct 28, 2025 • 84.1k • 21

yangzhch6/Putnam-Informal-1995-2024

Viewer • Updated Oct 27, 2025 • 360 • 11 • 1

yangzhch6/cuda-12.8-tar

Updated Oct 13, 2025 • 5

View 16 datasets