MIT/ast-finetuned-audioset-10-10-0.4593 Audio Classification • 86.6M • Updated Sep 6, 2023 • 506k • 353
unsloth/Qwen3-VL-2B-Instruct-GGUF Image-Text-to-Text • 2B • Updated Oct 31, 2025 • 32.9k • 31
Qwen3-VL Collection Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats. • 56 items • Updated 20 days ago • 33
Running on Zero Agents 15 Qwen3-VL Multimodal Search Engine 🔥 15 Cross-modal text-image search powered by Qwen3-VL
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 166
MiniCPM-o & MiniCPM-V Collection Multimodal models with leading performance. • 32 items • Updated about 12 hours ago • 81