view article Article How to generate text: using different decoding methods for language generation with Transformers patrickvonplaten β’ Mar 1, 2020 β’ 297
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c β’ Feb 20 β’ 505
Gemma Scope Release Collection A comprehensive, open suite of sparse autoencoders for Gemma 2 2B and 9B. β’ 10 items β’ Updated Mar 12 β’ 22
RM Sycophancy (LLaMa) Collection https://alignment.anthropic.com/2025/auditing-mo-replication/ β’ 9 items β’ Updated Feb 15 β’ 2
Llama 2 Family Collection This collection hosts the transformers and original repos of the Llama 2 and Llama Guard releases β’ 13 items β’ Updated Dec 6, 2024 β’ 99
Beyond Transcription: Mechanistic Interpretability in ASR Paper β’ 2508.15882 β’ Published Aug 21, 2025 β’ 89
Intern-S1: A Scientific Multimodal Foundation Model Paper β’ 2508.15763 β’ Published Aug 21, 2025 β’ 273
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper β’ 2508.01191 β’ Published Aug 2, 2025 β’ 240
LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos Paper β’ 2508.14041 β’ Published Aug 19, 2025 β’ 59
π Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized β’ 135 items β’ Updated Dec 18, 2025 β’ 119
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2 Paper β’ 2408.05147 β’ Published Aug 9, 2024 β’ 41
The Well Collection A 15TB collection of physics simulation datasets. β’ 18 items β’ Updated Mar 24, 2025 β’ 51
view article Article Learn the Hugging Face Kernel Hub in 5 Minutes +5 drbh, danieldk, Narsil, pcuenq, pagezyhf, merve, reach-vb β’ Jun 12, 2025 β’ 164
INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning Paper β’ 2505.07291 β’ Published May 12, 2025 β’ 15
view article Article LeRobot Community Datasets: The βImageNetβ of Robotics β When and How? +5 danaaubakirova, Beegbrain, mshukor, m1b, villekuosmanen, cadene, pcuenq β’ May 11, 2025 β’ 97
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper β’ 2506.01844 β’ Published Jun 2, 2025 β’ 159