@danielhanchen on Hugging Face: "Qwen releases Qwen3-Coder-Next! 💜 Run the locally on 46GB RAM or less. Thhe…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

posted an update 20 days ago

Post

3745

Qwen releases Qwen3-Coder-Next! 💜 Run the locally on 46GB RAM or less.

Thhe model excels at agentic coding & local use. With 256K context, it delivers similar performance to models with 10-20× more active parameters.

GGUF: unsloth/Qwen3-Coder-Next-GGUF
Guide: https://unsloth.ai/docs/models/qwen3-coder-next

YellowjacketGames

20 days ago

fits almst perfectly into an a6000!

danielhanchen

18 days ago

Hopefully it runs fast for you! :)

LeoHyperlink

19 days ago

I run it on threadripper 3970x with 256gb system ram and offloading computation layers to a gtx 1660 6gb vram. Using llama.cpp with -nkvo -kvu and all MoE on CPU. With an amazing speed on 14/TpS generation speed using q8_0. I’m amazed

danielhanchen

18 days ago

Awesome to hear, thanks for trying them out!

jjenny

18 days ago

awsome!

Shubhu377

6 days ago

•

edited 6 days ago

Too bad that i can't use it, 8gb vram and 24gb ddr5 ram, i might be able to run it at unusable speed with q1 but at that point im better off using glm 4.7 flash

Excited for qwen 3.5 30-40B range coding or general moe model

In this post