Anwar

abdoali5672

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

upvoted a paper 19 days ago

Same Architecture, Different Capacity: Optimizer-Induced Spectral Scaling Laws

upvoted a paper 20 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

View all activity

Organizations

None yet

commented 2 papers 9 months ago

Predicting LLM Reasoning Performance with Small Proxy Model

Paper • 2509.21013 • Published Sep 25, 2025 • 6 •

Direct Multi-Token Decoding

Paper • 2510.11958 • Published Oct 13, 2025 • 9 •

commented 3 papers 10 months ago

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Paper • 2509.14760 • Published Sep 18, 2025 • 53 •

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Paper • 2509.08031 • Published Sep 9, 2025 • 21 •

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11, 2025 • 259 •

commented 7 papers about 1 year ago

SuperBPE: Space Travel for Language Models

Paper • 2503.13423 • Published Mar 17, 2025 • 14 •

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 283 •

BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training

Paper • 2409.04599 • Published Sep 6, 2024 • 2 •

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 289 •

Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks

Paper • 2504.07835 • Published Apr 10, 2025 • 1 •

Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities

Paper • 2505.01043 • Published May 2, 2025 • 10 •

A Survey on Post-training of Large Language Models

Paper • 2503.06072 • Published Mar 8, 2025 • 11 •