Liam
LiLiam
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
Low-probability Tokens Sustain Exploration in Reinforcement Learning
with Verifiable Reward
upvoted
a
paper
6 months ago
xbench: Tracking Agents Productivity Scaling with Profession-Aligned
Real-World Evaluations
upvoted
a
paper
6 months ago
The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in
Learning to Reason