baidu/ERNIE-4.5-VL-28B-A3B-Thinking Image-Text-to-Text • 30B • Updated Dec 24, 2025 • 1.27k • 518
nvidia/nemo-nano-codec-22khz-1.89kbps-21.5fps Feature Extraction • Updated Aug 30, 2025 • 1.6k • 9
bosonai/higgs-audio-v2-generation-3B-base Text-to-Speech • 6B • Updated Jul 28, 2025 • 147k • 657
mistralai/Voxtral-Small-24B-2507 Audio-Text-to-Text • 24B • Updated Dec 20, 2025 • 70.2k • 445