I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Paper • 2503.18878 • Published Mar 24, 2025 • 119
Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls Paper • 2510.00184 • Published Sep 30, 2025 • 16
Mechanistic Interpretability of Large-Scale Counting in LLMs through a System-2 Strategy Paper • 2601.02989 • Published 3 days ago • 4