Predicting LLM Reasoning Performance with Small Proxy Model Paper • 2509.21013 • Published Sep 25, 2025 • 1 • 2
Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration Paper • 2509.14760 • Published Sep 18, 2025 • 53 • 3
AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs Paper • 2509.08031 • Published Sep 9, 2025 • 21 • 3
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model Paper • 2509.09372 • Published Sep 11, 2025 • 243 • 7
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning Paper • 2505.24726 • Published May 30, 2025 • 277 • 9
BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training Paper • 2409.04599 • Published Sep 6, 2024 • 2 • 2
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8, 2025 • 287 • 44
Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks Paper • 2504.07835 • Published Apr 10, 2025 • 2
Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities Paper • 2505.01043 • Published May 2, 2025 • 10 • 3
A Survey on Post-training of Large Language Models Paper • 2503.06072 • Published Mar 8, 2025 • 10 • 2