arxiv:2510.25741
Ziniu Li
ziniuli
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 hours ago
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method
for Aligning Large Language Models
upvoted
a
paper
about 2 hours ago
Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning
upvoted
a
paper
7 days ago
How Far Are We from Genuinely Useful Deep Research Agents?