Wentian Zhao's picture

7

Wentian Zhao

zwt123home123

·

[email protected]

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago

Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs

upvoted a paper 2 months ago

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

upvoted a paper 2 months ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

View all activity

Organizations

None yet

Papers 2

arxiv:2504.09710

arxiv:2410.06169

models 116

zwt123home123/code_log_3

zwt123home123/reproduce_log

zwt123home123/code_log_2

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-4ppl_largebs_global_step_320_actor

8B • Updated Apr 3 • 3

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-4ppl_largebs_global_step_203_actor

8B • Updated Apr 3 • 2

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-3ppl_largebs_global_step_203_actor

8B • Updated Apr 3 • 3

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-3ppl_largebs_global_step_400_actor

8B • Updated Apr 3 • 3

zwt123home123/global_step_840_actor

8B • Updated Apr 2 • 3

zwt123home123/InternVL2_5-8B

Image-Text-to-Text • 8B • Updated Feb 19 • 15

zwt123home123/KV_internvl26b

View 116 models

datasets 2

zwt123home123/code_log_2

Updated May 12 • 8

zwt123home123/code_log

Updated May 12 • 7