arxiv:2506.06395
Li Pengyi
LiPengyi29
AI & ML interests
None yet
Recent Activity
upvoted a paper 13 days ago
Breaking the Chain: A Causal Analysis of LLM Faithfulness to Intermediate Structures upvoted a paper 30 days ago
How Far Can Unsupervised RLVR Scale LLM Training? upvoted a paper about 2 months ago
Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities