Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
396
20
383
John Leimgruber III
PRO
ubergarm
Follow
ThomasBaruzier's profile picture
kuroe3's profile picture
Kiruya's profile picture
374 followers
·
62 following
https://blog.aifoundry.org/p/adventures-in-model-quantization
ubergarm
john-leimgruber
AI & ML interests
Open LLMs and Astrophotography image processing.
Recent Activity
new
activity
about 17 hours ago
ubergarm/GLM-4.7-GGUF:
Stable run on 2x RTX 5090 and 2 Xeon E5 2696 V4 and DDR4 with ik_llama.cpp - 6.1 t/s on IQ4_K and 5.1 t/s on IQ5_K, opencode works with this
updated
a model
about 17 hours ago
ubergarm/Qwen3.5-397B-A17B-GGUF
new
activity
about 17 hours ago
tarruda/Qwen3.5-397B-A17B-GGUF:
Great job on this one!
View all activity
Organizations
ubergarm
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
ubergarm/GLM-4.7-GGUF
about 17 hours ago
Stable run on 2x RTX 5090 and 2 Xeon E5 2696 V4 and DDR4 with ik_llama.cpp - 6.1 t/s on IQ4_K and 5.1 t/s on IQ5_K, opencode works with this
👍
1
20
#5 opened 3 months ago by
martossien
updated
a model
about 17 hours ago
ubergarm/Qwen3.5-397B-A17B-GGUF
Text Generation
•
396B
•
Updated
about 17 hours ago
•
5.12k
•
33
New activity in
tarruda/Qwen3.5-397B-A17B-GGUF
about 17 hours ago
Great job on this one!
1
#1 opened about 17 hours ago by
ubergarm
liked
2 models
about 17 hours ago
tarruda/Qwen3.5-397B-A17B-GGUF
Text Generation
•
396B
•
Updated
3 days ago
•
536
•
1
eousphoros/kappa-20b-131k-GGUF
Text Generation
•
21B
•
Updated
Mar 1
•
233
•
6
liked
a model
2 days ago
bartowski/google_gemma-3-4b-it-GGUF
Image-Text-to-Text
•
4B
•
Updated
Mar 22, 2025
•
28.1k
•
34
New activity in
ubergarm/GLM-5-GGUF
6 days ago
Unreleased
1
#6 opened 8 days ago by
jpsequeira
New activity in
ubergarm/Qwen3-Coder-Next-GGUF
6 days ago
Improving Qwen3 Coder Next 80b performance on ik_llama vs llama.cpp
👍
👀
2
16
#6 opened 27 days ago by
sabotage3d
New activity in
sokann/Qwen3.5-27B-GGUF-4.915bpw
9 days ago
Nice work thanks for more ik_llama.cpp quants!
4
#1 opened 17 days ago by
ubergarm
liked
a model
10 days ago
rodrigomt/s2-pro-gguf
Text-to-Speech
•
5B
•
Updated
11 days ago
•
5.43k
•
27
New activity in
rodrigomt/s2-pro-gguf
10 days ago
I created an API server version of s2.cpp
👍
1
2
#4 opened 11 days ago by
mach9243
New activity in
AesSedai/Qwen3.5-397B-A17B-GGUF
10 days ago
IQ2_XS?
🔥
2
30
#6 opened 20 days ago by
tarruda
New activity in
ubergarm/Qwen3.5-27B-GGUF
10 days ago
Insight into the "weird" data.
130
#3 opened about 1 month ago by
espen96
New activity in
ubergarm/Qwen3.5-397B-A17B-GGUF
10 days ago
Qwen3.5-397B-A17B-IQ4_KSS on 8 RTX 3090 context 161K tokens load by ik_llama.cpp , test with opencode
🔥
1
2
#12 opened 11 days ago by
martossien
New activity in
tarruda/Qwen3.5-397B-A17B-heretic-smol-IQ2_XS-GGUF
10 days ago
Any chance of IQ2_XXS? IQ2_XS is just slightly too big for Strix Halo.
6
#2 opened 11 days ago by
Cortex0833
New activity in
ubergarm/MiniMax-M2.5-GGUF
10 days ago
ik_llama.cpp version
13
#11 opened about 2 months ago by
geveent
New activity in
ubergarm/Qwen3.5-27B-GGUF
10 days ago
Appraisal
🔥
1
5
#6 opened 16 days ago by
wonderfuldestruction
New activity in
AesSedai/Mistral-Small-4-119B-2603-GGUF
11 days ago
Mistral-Small-4-119B-2603-Q5_K_M on 8 RTX 3090 with ik_llama.cpp ( compil 21 march 2026 )
❤️
🔥
3
7
#1 opened 12 days ago by
martossien
liked
a model
11 days ago
fishaudio/s2-pro
Text-to-Speech
•
5B
•
Updated
22 days ago
•
25k
•
798
New activity in
ubergarm/Qwen3.5-122B-A10B-GGUF
12 days ago
How to split this model between 2 (3) GPUs and CPU/RAM ?
18
#12 opened 15 days ago by
mancub
Load more