Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jianhao Yan's picture
5 13 3

Jianhao Yan

Elliott
dark-pen's profile picture lvjunhui's profile picture John6666's profile picture
·
  • ElliottYan

AI & ML interests

None yet

Organizations

None yet

commented a paper 8 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88 •
6
commented a paper 9 months ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 98 •
15
New activity in Elliott/Qwen2.5-Math-7B-16k-think 9 months ago

Add library name, pipeline tag, link to Github

#1 opened 9 months ago by
nielsr
New activity in Elliott/Openr1-Math-46k-8192 9 months ago

Add task category

#2 opened 9 months ago by
nielsr
New activity in Elliott/LUFFY-Qwen-Math-7B-Zero 9 months ago

Correct pipeline tag and add Github link

#1 opened 9 months ago by
nielsr
commented 2 papers 9 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88 •
6

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88 •
6
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs