Models

152

Full-text search

Active filters: vLLM

mistralai/Mistral-Small-4-119B-2603

119B • Updated 2 days ago • 5.36k • 240

mistralai/Mistral-Small-4-119B-2603-NVFP4

Updated 2 days ago • 723 • 58

mistralai/Mistral-Small-4-119B-2603-eagle

Updated 2 days ago • 188 • 31

unsloth/Mistral-Small-4-119B-2603-GGUF

119B • Updated 2 days ago • 17.9k • 29

QuantTrio/Qwen3.5-27B-AWQ

Image-Text-to-Text • 28B • Updated 18 days ago • 218k • 23

bartowski/mistralai_Mistral-Small-4-119B-2603-GGUF

Image-Text-to-Text • 119B • Updated 2 days ago • 4.8k • 6

QuantTrio/Qwen3.5-122B-A10B-AWQ

Image-Text-to-Text • 125B • Updated 21 days ago • 36.7k • 18

QuantTrio/Qwen3.5-35B-A3B-AWQ

Image-Text-to-Text • 36B • Updated 21 days ago • 121k • 11

unsloth/Mistral-Small-4-119B-2603

119B • Updated 3 days ago • 146 • 3

cyankiwi/Mistral-Small-4-119B-2603-AWQ-4bit

123B • Updated 1 day ago • 94 • 2

QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix

Text Generation • 253B • Updated Sep 5, 2025 • 45 • 4

QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ

Text Generation • 31B • Updated Oct 8, 2025 • 321k • 41

QuantTrio/Qwen3-VL-32B-Instruct-AWQ

Image-Text-to-Text • 33B • Updated Oct 22, 2025 • 101k • 12

QuantTrio/Kimi-K2.5-E304

Image-Text-to-Text • 138B • Updated Feb 2 • 16.4k • 3

QuantTrio/Qwen3.5-9B-AWQ

Image-Text-to-Text • 10B • Updated 16 days ago • 73.1k • 4

mlx-community/Mistral-Small-4-119B-2603-4bit

19B • Updated 1 day ago • 825 • 1

inferencerlabs/Mistral-Small-4-119B-2603-MLX-9bit

Text Generation • 38B • Updated about 2 hours ago • 371 • 1

model-scope/glm-4-9b-chat-GPTQ-Int4

Text Generation • 9B • Updated Jul 17, 2024 • 52 • 6

model-scope/glm-4-9b-chat-GPTQ-Int8

Text Generation • 9B • Updated Jul 23, 2024 • 14 • 2

tclf90/qwen2.5-72b-instruct-gptq-int4

Text Generation • 73B • Updated May 12, 2025 • 117 • 2

tclf90/qwen2.5-72b-instruct-gptq-int3

Text Generation • 69B • Updated May 12, 2025 • 148

prithivMLmods/Nu2-Lupi-Qwen-14B

Text Generation • 15B • Updated Mar 27, 2025 • 2 • 2

mradermacher/Nu2-Lupi-Qwen-14B-GGUF

15B • Updated Jul 11, 2025 • 201 • 1

mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF

15B • Updated Jul 11, 2025 • 97 • 1

JunHowie/Qwen3-0.6B-GPTQ-Int4

Text Generation • 0.6B • Updated Sep 3, 2025 • 147 • 1

JunHowie/Qwen3-0.6B-GPTQ-Int8

Text Generation • 0.6B • Updated Sep 3, 2025 • 15

JunHowie/Qwen3-1.7B-GPTQ-Int4

Text Generation • 2B • Updated Sep 3, 2025 • 414 • 1

JunHowie/Qwen3-1.7B-GPTQ-Int8

Text Generation • 2B • Updated Sep 3, 2025

JunHowie/Qwen3-32B-GPTQ-Int4

Text Generation • 33B • Updated Sep 5, 2025 • 3.19k • 4

JunHowie/Qwen3-32B-GPTQ-Int8

Text Generation • 33B • Updated Sep 5, 2025 • 1.77k • 4