Ziniu Li's picture

5 24 5

Ziniu Li

ziniuli

·

http://www.liziniu.org/

liziniu

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

upvoted a paper about 2 hours ago

Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning

upvoted a paper 7 days ago

How Far Are We from Genuinely Useful Deep Research Agents?

View all activity

Organizations

Papers 13

arxiv:2510.25741

arxiv:2509.25849

arxiv:2508.17445

arxiv:2508.09099

models 1

ziniuli/Mistral-7B-ReMax-v0.1

Text Generation • 7B • Updated Feb 29, 2024 • 52 • 4

datasets 1

ziniuli/rollout

Updated 12 days ago • 126