view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 115
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models Dec 15, 2025 • 106
The Bestiary Collection Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated Nov 16, 2025 • 79
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 24 items • Updated 2 days ago • 90
Granite Quantized Models Collection Quantized versions of IBM Granite models. Licensed under the Apache 2.0 license. • 44 items • Updated Nov 21, 2025 • 31
On Path to Multimodal Generalist: General-Level and General-Bench Paper • 2505.04620 • Published May 7, 2025 • 82
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 25 days ago • 162
Josiefied and Abliterated Qwen3 Collection Abliterated, and further fine-tuned to be the most uncensored models available. • 18 items • Updated 3 days ago • 31