Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

33

Full-text search

Active filters: nebius

meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 5.09M • • 5.09k

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27 • 1.18M • • 12k

moonshotai/Kimi-K2-Thinking

Text Generation • Updated 30 days ago • 395k • • 1.5k

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26 • 7.95M • • 4.03k

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26 • 4.51M • • 4.22k

Qwen/Qwen3-Coder-30B-A3B-Instruct

Text Generation • 31B • Updated 4 days ago • 1.16M • • 798

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 791k • • 4.45k

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 1.37M • • 1.73k

Qwen/Qwen3-Embedding-8B

Feature Extraction • 8B • Updated Jul 7 • 804k • • 473

Qwen/Qwen3-30B-A3B-Instruct-2507

Text Generation • 31B • Updated Sep 17 • 591k • • 681

google/gemma-2-2b-it

Text Generation • 3B • Updated Aug 27, 2024 • 757k • • 1.24k

meta-llama/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Dec 21, 2024 • 410k • • 2.59k

Qwen/Qwen3-32B

Text Generation • 33B • Updated Jul 26 • 4.05M • • 595

Qwen/Qwen3-Coder-480B-A35B-Instruct

Text Generation • 480B • Updated Aug 21 • 196k • • 1.25k

Qwen/Qwen3-30B-A3B-Thinking-2507

Text Generation • 31B • Updated Aug 17 • 507k • • 323

moonshotai/Kimi-K2-Instruct

Text Generation • 1T • Updated about 1 month ago • 172k • • 2.27k

nvidia/NVIDIA-Nemotron-Nano-12B-v2

Text Generation • 12B • Updated 13 days ago • 27.1k • • 140

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6 • 163k • • 569

zai-org/GLM-4.5-Air

Text Generation • 110B • Updated Aug 11 • 563k • • 531

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated Sep 17 • 116k • • 729

Qwen/Qwen3-235B-A22B-Thinking-2507

Text Generation • 235B • Updated Aug 17 • 80.9k • • 382

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11 • 22.2k • • 1.39k

NousResearch/Hermes-4-70B

Text Generation • 71B • Updated Sep 2 • 1.98k • • 160

google/gemma-2-9b-it

Text Generation • 9B • Updated Aug 27, 2024 • 149k • • 747

Qwen/Qwen2.5-Coder-7B

Text Generation • 8B • Updated Nov 18, 2024 • 35.1k • • 128

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27 • 142k • • 3.08k

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated May 29 • 423k • • 2.39k

NousResearch/Hermes-4-405B

Text Generation • 406B • Updated Sep 2 • 215 • • 77

PrimeIntellect/INTELLECT-3-FP8

Text Generation • 107B • Updated 11 days ago • 2.14k • • 18

intfloat/e5-mistral-7b-instruct

Feature Extraction • 7B • Updated Apr 23, 2024 • 152k • • 552