view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 119
view article Article The Great Classification Showdown: OSS vs BERT on Consumer Hardware 17 days ago • 12
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family 24 days ago • 81
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11, 2025 • 106
view article Article Why You Should Care About Partial Differential Equations (PDEs) Dec 12, 2025 • 41
view article Article How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day Dec 8, 2025 • 52
view article Article Who Routes LLM Routers? RouterArena: Building the Evaluation Foundation for LLM Routing Nov 11, 2025 • 12
abliterated-v3 Collection Latest gen of the abliterated models I've produced • 17 items • Updated Jun 3, 2024 • 137
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 • 273
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... Jan 20, 2025 • 76
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated Apr 10, 2025 • 111