namgyu-youn

namgyu-youn

AI & ML interests

None yet

Recent Activity

updated a model 4 days ago

namgyu-youn/Qwen3-0.6B-INT8-INT4-SINQ

upvoted a paper 4 days ago

FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving

upvoted a paper 5 days ago

LFM2 Technical Report

View all activity

Organizations

None yet

updated a model 4 days ago

namgyu-youn/Qwen3-0.6B-INT8-INT4-SINQ

Updated 4 days ago • 57

upvoted a paper 4 days ago

FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving

Paper • 2501.01005 • Published Jan 2 • 2

upvoted a paper 5 days ago

LFM2 Technical Report

Paper • 2511.23404 • Published 8 days ago • 34

updated a model 7 days ago

namgyu-youn/Qwen3-30B-A3B-Thinking-2507-INT8-INT4-SINQ

Updated 7 days ago • 21

updated a model 9 days ago

namgyu-youn/Qwen3-30B-A3B-Thinking-2507-INT8-INT4-HQQ

Updated 9 days ago • 24

published a model 9 days ago

namgyu-youn/Qwen3-30B-A3B-Thinking-2507-INT8-INT4-SINQ

Updated 7 days ago • 21

published a model 10 days ago

namgyu-youn/Qwen3-30B-A3B-Thinking-2507-INT8-INT4-HQQ

Updated 9 days ago • 24

published a model 11 days ago

namgyu-youn/Qwen3-0.6B-INT8-INT4-SINQ

Updated 4 days ago • 57

upvoted a paper 18 days ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29 • 76

upvoted a collection about 2 months ago

SINQ

Collection

This collection contains the models quantized with the SINQ quantization method. • 19 items • Updated 13 days ago • 10

upvoted 2 papers about 2 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 491

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

updated a model 3 months ago

namgyu-youn/Qwen3-8B-SMOOTHQUANT-W8A8

Text Generation • Updated Sep 17 • 37

published a model 3 months ago

namgyu-youn/Phi-4-mini-instruct-SMOOTHQUANT-W8A8

Updated Sep 17

updated a model 3 months ago

namgyu-youn/Phi-4-mini-instruct-AWQ-INT4

Text Generation • Updated Sep 17 • 38

published a model 3 months ago

namgyu-youn/Phi-4-mini-instruct-AWQ-INT4

Text Generation • Updated Sep 17 • 38

upvoted a paper 3 months ago

The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Paper • 2507.18553 • Published Jul 24 • 40

published a model 3 months ago

namgyu-youn/Qwen3-8B-SMOOTHQUANT-W8A8

Text Generation • Updated Sep 17 • 37

liked a model 4 months ago

Qwen/Qwen3-30B-A3B

Text Generation • 31B • Updated Jul 26 • 378k • • 823

updated a Space 4 months ago

job-tracker

🐳

namgyu-youn

AI & ML interests

Recent Activity

Organizations

namgyu-youn's activity

job-tracker