Running on CPU Upgrade Featured 3.21k The Smol Training Playbook 📚 3.21k The secrets to building world-class LLMs
Running 3.9k The Ultra-Scale Playbook 🌌 3.9k The ultimate guide to training LLM on large GPU Clusters
Running 601 Scaling test-time compute 📈 601 Boost LLM answers with flexible test‑time search strategies
Running Featured 127 Open-LLM performances are plateauing, let’s make the leaderboard steep again 🏔 127 Explore and compare advanced language models on a new leaderboard
Running Agents 231 BigCodeBench Leaderboard 🥇 231 Explore code-generation model leaderboards and task details
Running Agents 232 AI2 WildBench Leaderboard (V2) 🦁 232 Display LLM performance leaderboards with customizable views
Running Featured 1.37k FineWeb: decanting the web for the finest text data at scale 🍷 1.37k Explore and download the FineWeb web‑scale text dataset