Inference Providers
Active filters: AWQ
QuantTrio/gemma-4-31B-it-AWQ-6Bit
Image-Text-to-Text
• 31B • Updated • 6.91k
• 6
QuantTrio/GLM-4.7-Flash-AWQ
Text Generation
• 31B • Updated • 104k
• 11
QuantTrio/Qwopus3.5-27B-v3-AWQ
Image-Text-to-Text
• 27B • Updated • 13.3k
• 5
QuantTrio/Qwen3.5-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 371k
• 35
Image-Text-to-Text
• 10B • Updated • 341k
• 12
QuantTrio/gemma-4-31B-it-AWQ
Image-Text-to-Text
• 31B • Updated • 21.6k
• 3
QuantTrio/MiniMax-M2.5-AWQ
Text Generation
• 229B • Updated • 51.8k
• 14
QuantTrio/Qwen3.5-122B-A10B-AWQ
Image-Text-to-Text
• 125B • Updated • 56.9k
• 25
solidrust/Mistral-7B-Instruct-v0.3-AWQ
Text Generation
• 7B • Updated • 3.32k
• 9
pomelk1n/RuadaptQwen2.5-32B-instruct-4-bit-AWQ-Marlin
Text Generation
• 33B • Updated • 6
• 1
pomelk1n/RuadaptQwen2.5-32B-instruct-4-bit-AWQ-GEMM
Text Generation
• 33B • Updated • 6
• 2
QuantTrio/Qwen3.5-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 156k
• 16
QuantTrio/Qwen3.5-397B-A17B-AWQ
Image-Text-to-Text
• Updated • 20.1k
• 9
Text Generation
• 586B • Updated • 4.95k
• 5
Image-Text-to-Text
• 5B • Updated • 38.5k
• 7
QuantTrio/Qwopus3.5-27B-v3-AWQ-6Bit
Image-Text-to-Text
• 27B • Updated • 980
• 1
abhinavkulkarni/mosaicml-mpt-7b-instruct-w4-g128-awq
Text Generation
• Updated • 8
abhinavkulkarni/mosaicml-mpt-7b-chat-w4-g128-awq
Text Generation
• 1B • Updated • 25
abhinavkulkarni/VMware-open-llama-7b-open-instruct-w4-g128-awq
Text Generation
• Updated • 5
abhinavkulkarni/VMware-open-llama-13b-open-instruct-w4-g128-awq
Text Generation
• Updated • 3
• 3
abhinavkulkarni/tiiuae-falcon-7b-instruct-w4-g64-awq
Text Generation
• Updated • 4
• 5
abhinavkulkarni/psmathur-orca_mini_v2_7b-w4-g128-awq
Text Generation
• Updated • 10
• 2
abhinavkulkarni/Salesforce-codegen25-7b-multi-w4-g128-awq
Text Generation
• Updated • 19
• 2
abhinavkulkarni/psmathur-orca_mini_v2_13b-w4-g128-awq
Text Generation
• Updated • 5
• 2
abhinavkulkarni/mosaicml-mpt-30b-instruct-w4-g128-awq
Text Generation
• Updated • 13
• 2
abhinavkulkarni/mosaicml-mpt-30b-chat-w4-g128-awq
Text Generation
• 4B • Updated • 7
abhinavkulkarni/VMware-open-llama-7b-v2-open-instruct-w4-g128-awq
Text Generation
• Updated • 2
abhinavkulkarni/tiiuae-falcon-40b-instruct-w4-g128-awq
Text Generation
• Updated • 8
• 2
abhinavkulkarni/Salesforce-codegen25-7b-instruct-w4-g128-awq
Text Generation
• Updated • 19
• 3
abhinavkulkarni/meta-llama-Llama-2-7b-chat-hf-w4-g128-awq
Text Generation
• Updated • 10
• 6