view article Article SyGra: The One-Stop Framework for Building Data for LLMs and SLMs Sep 22, 2025 • 14
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 13 days ago • 47
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers 4 days ago • 34
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention Oct 7, 2024 • 70
Running on CPU Upgrade Featured 3.1k The Smol Training Playbook 📚 3.1k The secrets to building world-class LLMs
Running 3.77k The Ultra-Scale Playbook 🌌 3.77k The ultimate guide to training LLM on large GPU Clusters