-
-
-
-
-
-
Inference Providers
Active filters: FP8
duydq12/GLM-Z1-32B-0414-FP8-dynamic
Text Generation
• 33B • Updated
• 4
duydq12/nomic-embed-code-FP8-dynamic
Text Generation
• 8B • Updated
• 243
• 1
duydq12/Qwen2.5-Coder-1.5B-Instruct-FP8-dynamic
Text Generation
• 2B • Updated
• 24
duydq12/Qwen2.5-Coder-3B-Instruct-FP8-dynamic
Text Generation
• 3B • Updated
• 2
nvidia/Qwen3-235B-A22B-FP8
Text Generation
• 235B • Updated
• 817
• 3
Image-Text-to-Text
• 109B • Updated
• 1
EliovpAI/Qwen3-14B-FP8-KV
Text Generation
• 15B • Updated
• 4
• 2
clarifai/Qwen3-Coder-30B-A3B-Instruct-FP8-Dynamic
Text Generation
• 31B • Updated
• 16
• 4
EliovpAI/Qwen3-0.6B-FP8-KV
Text Generation
• 0.6B • Updated
• 3
RedHatAI/Devstral-Small-2507-FP8-Dynamic
Text Generation
• 24B • Updated
• 22
• 4
nvidia/Phi-4-multimodal-instruct-FP8
6B • Updated
• 15.1k
• 4
nvidia/Phi-4-reasoning-plus-FP8
15B • Updated
• 518
• 3
EliovpAI/Deepseek-R1-0528-Qwen3-8B-FP8-KV
Text Generation
• 8B • Updated
• 2
Text Generation
• 15B • Updated
• 3.83k
• 4
nvidia/Qwen2.5-VL-7B-Instruct-FP8
Text Generation
• 8B • Updated
• 372
• 7
QuantTrio/Qwen3-VL-235B-A22B-Instruct-FP8
Text Generation
• Updated
• 109
QuantTrio/Qwen3-VL-235B-A22B-Thinking-FP8
Text Generation
• 236B • Updated
• 30
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-FP8
Image-Text-to-Text
• 13B • Updated
• 11.7k
• 47
tokenlabsdotrun/Llama-3.1-8B-ModelOpt-FP8-QAT