Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
tencent
/
DeepSeek-V3.1-Terminus-W4AFP8
like
15
Follow
Tencent
8.12k
Text Generation
Transformers
Safetensors
deepseek_v3
quantized
TensorRT-Model-Optimizer
int4
fp8
conversational
custom_code
text-generation-inference
8-bit precision
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
DeepSeek-V3.1-Terminus-W4AFP8
/
hf_quant_config.json
Xijun Chen
initial commit
9e6eee3
2 months ago
raw
Copy download link
history
blame
contribute
delete
Safe
108 Bytes
{
"quantization"
:
{
"quant_algo"
:
"MIXED_PRECISION"
,
"kv_cache_quant_algo"
:
null
}
}