Inference Providers
Active filters: vLLM
mistralai/Mistral-Small-4-119B-2603
119B • Updated • 5.36k
• 240
mistralai/Mistral-Small-4-119B-2603-NVFP4
Updated • 723
• 58
mistralai/Mistral-Small-4-119B-2603-eagle
Updated • 188
• 31
unsloth/Mistral-Small-4-119B-2603-GGUF
119B • Updated • 17.9k
• 29
QuantTrio/Qwen3.5-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 218k
• 23
bartowski/mistralai_Mistral-Small-4-119B-2603-GGUF
Image-Text-to-Text
• 119B • Updated • 4.8k
• 6
QuantTrio/Qwen3.5-122B-A10B-AWQ
Image-Text-to-Text
• 125B • Updated • 36.7k
• 18
QuantTrio/Qwen3.5-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 121k
• 11
unsloth/Mistral-Small-4-119B-2603
119B • Updated • 146
• 3
cyankiwi/Mistral-Small-4-119B-2603-AWQ-4bit
123B • Updated • 94
• 2
QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix
Text Generation
• 253B • Updated • 45
• 4
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
• 31B • Updated • 321k
• 41
QuantTrio/Qwen3-VL-32B-Instruct-AWQ
Image-Text-to-Text
• 33B • Updated • 101k
• 12
Image-Text-to-Text
• 138B • Updated • 16.4k
• 3
Image-Text-to-Text
• 10B • Updated • 73.1k
• 4
mlx-community/Mistral-Small-4-119B-2603-4bit
19B • Updated • 825
• 1
inferencerlabs/Mistral-Small-4-119B-2603-MLX-9bit
Text Generation
• 38B • Updated • 371
• 1
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 52
• 6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
• 9B • Updated • 14
• 2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 117
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 148
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
• 15B • Updated • 2
• 2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B • Updated • 201
• 1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B • Updated • 97
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
• 0.6B • Updated • 147
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 15
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 414
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 3.19k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 1.77k
• 4