llm-rl - a Sha-Andrey Collection

Sha-Andrey 's Collections

llm-rl

python

llm-rl

updated Apr 30, 2025

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139