Inference Providers
Active filters: VPTQ
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v16-k65536-32768-woft
7B • Updated • 4
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-65536-woft
2B • Updated • 22
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-4096-woft
2B • Updated • 2
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-256-woft
2B • Updated • 37
• 1
VPTQ-community/Qwen2.5-72B-Instruct-v16-k65536-65536-woft
8B • Updated • 16
• 4
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v16-k65536-65536-woft
8B • Updated • 2
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-256-woft
9B • Updated • 1
• 1
VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-256-woft
2B • Updated • 4
VPTQ-community/Qwen2.5-72B-Instruct-v16-k65536-32768-woft
8B • Updated • 5
• 3
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k32768-0-woft
6B • Updated • 4
• 1
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-65536-woft
11B • Updated • 11
• 2
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k16384-0-woft
6B • Updated • 1
• 2
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-0-woft
7B • Updated • 11
• 2
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-4-woft-duplicated
8B • Updated • 1
• 1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-1024-woft
26B • Updated • 8
• 1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k4096-0-woft
23B • Updated • 24
• 1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-64-woft
22B • Updated • 2
• 3
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k32768-32768-woft
29B • Updated • 4
• 1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-128-woft
23B • Updated • 1
• 1
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-4-woft
8B • Updated • 3
• 2
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-0-woft
7B • Updated • 9
• 2
VPTQ-community/Qwen2.5-72B-Instruct-v8-k512-512-woft
7B • Updated • 5
• 1
VPTQ-community/Qwen2.5-72B-Instruct-v8-k1024-512-woft
8B • Updated • 2
• 2
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-256-woft
24B • Updated • 6
• 1
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-256-woft
9B • Updated • 3
• 4
VPTQ-community/Qwen2.5-14B-Instruct-v8-k256-256-woft
2B • Updated VPTQ-community/Qwen2.5-14B-Instruct-v16-k65536-65536-woft
3B • Updated • 1
VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-256-woft
3B • Updated VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-0-woft
3B • Updated • 3
VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-65536-woft
4B • Updated • 7