SmolLM 🤏 SmolLM models, datasets and demos HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 150k • 946 HuggingFaceTB/SmolLM2-1.7B-Instruct Text Generation • Updated Apr 21, 2025 • 166k • 730 HuggingFaceTB/SmolVLM-Instruct Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 27.8k • 585 HuggingFaceTB/SmolLM2-360M-Instruct Text Generation • Updated Sep 22, 2025 • 199k • 186
📚 Filtering the web with LLMs HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 515k • 1.05k HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 35.3k • • 211 HuggingFaceFW/ablation-model-fineweb-edu Text Generation • 2B • Updated Jun 11, 2024 • 447 • 23 math-ai/AutoMathText Viewer • Updated Jul 16, 2025 • 7.89M • 10.7k • 185
HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 35.3k • • 211
✨ Code Generation Code generation models and datassets! bigcode/starcoder2-15b Text Generation • Updated Jun 5, 2024 • 11.5k • 670 bigcode/the-stack Viewer • Updated Apr 13, 2023 • 546M • 14.9k • 988 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 90.8k • 219 bigcode/starcoder Text Generation • 16B • Updated Oct 8, 2024 • 14.1k • 2.94k
Instruct datasets QuixiAI/SystemChat-2.0 Viewer • Updated Jun 15, 2025 • 141k • 600 • 76 arcee-ai/infini-instruct-top-500k Viewer • Updated Jun 30, 2024 • 500k • 102 • 6 arcee-ai/The-Tome Viewer • Updated Aug 15, 2024 • 1.75M • 615 • 105 teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 22.1k • 826
🌌 Synthetic textbooks Synthetically generated textbooks HuggingFaceTB/cosmopedia Viewer • Updated Aug 12, 2024 • 31.1M • 20.3k • 688 Locutusque/UltraTextbooks Viewer • Updated Feb 2, 2024 • 5.52M • 1.68k • 198 microsoft/phi-2 Text Generation • 3B • Updated Dec 8, 2025 • 555k • 3.45k HuggingFaceTB/cosmo-1b Text Generation • 2B • Updated Jul 8, 2024 • 315 • 134
SmolLM 🤏 SmolLM models, datasets and demos HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 150k • 946 HuggingFaceTB/SmolLM2-1.7B-Instruct Text Generation • Updated Apr 21, 2025 • 166k • 730 HuggingFaceTB/SmolVLM-Instruct Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 27.8k • 585 HuggingFaceTB/SmolLM2-360M-Instruct Text Generation • Updated Sep 22, 2025 • 199k • 186
Instruct datasets QuixiAI/SystemChat-2.0 Viewer • Updated Jun 15, 2025 • 141k • 600 • 76 arcee-ai/infini-instruct-top-500k Viewer • Updated Jun 30, 2024 • 500k • 102 • 6 arcee-ai/The-Tome Viewer • Updated Aug 15, 2024 • 1.75M • 615 • 105 teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 22.1k • 826
📚 Filtering the web with LLMs HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 515k • 1.05k HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 35.3k • • 211 HuggingFaceFW/ablation-model-fineweb-edu Text Generation • 2B • Updated Jun 11, 2024 • 447 • 23 math-ai/AutoMathText Viewer • Updated Jul 16, 2025 • 7.89M • 10.7k • 185
HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 35.3k • • 211
🌌 Synthetic textbooks Synthetically generated textbooks HuggingFaceTB/cosmopedia Viewer • Updated Aug 12, 2024 • 31.1M • 20.3k • 688 Locutusque/UltraTextbooks Viewer • Updated Feb 2, 2024 • 5.52M • 1.68k • 198 microsoft/phi-2 Text Generation • 3B • Updated Dec 8, 2025 • 555k • 3.45k HuggingFaceTB/cosmo-1b Text Generation • 2B • Updated Jul 8, 2024 • 315 • 134
✨ Code Generation Code generation models and datassets! bigcode/starcoder2-15b Text Generation • Updated Jun 5, 2024 • 11.5k • 670 bigcode/the-stack Viewer • Updated Apr 13, 2023 • 546M • 14.9k • 988 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 90.8k • 219 bigcode/starcoder Text Generation • 16B • Updated Oct 8, 2024 • 14.1k • 2.94k