DeepSeek
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference
DeepSeek-OCR 2: Visual Causal Flow
-
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • 685B • Updated • 204k • • 994 -
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation • 685B • Updated • 891 • 67 -
deepseek-ai/DeepSeek-V3.2
Text Generation • 685B • Updated • 5.27M • • 1.43k -
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation • Updated • 10.8k • • 703
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 3.83M • • 13.3k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 5.19k • 957 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 126k • • 773 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 859k • • 1.56k
DeepSeek Math series
-
deepseek-ai/DeepSeek-Math-V2
Text Generation • 685B • Updated • 626 • 694 -
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 39.1k • 151 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 4.01k • 94 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 3.65k • 89
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
-
deepseek-ai/DeepSeek-V2-Chat-0628
Text Generation • 236B • Updated • 4.33k • 178 -
deepseek-ai/DeepSeek-V2-Chat
Text Generation • 236B • Updated • 8.39k • 462 -
deepseek-ai/DeepSeek-V2
Text Generation • 236B • Updated • 4.3k • 334 -
deepseek-ai/DeepSeek-V2-Lite
Text Generation • 16B • Updated • 509k • 175
models for paper expert-specialized fine-tuning
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 5.9k • 567 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 191k • 493 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 185k • 151 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • Updated • 38.6k • 163
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 118k • 686 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 848 • 81 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 2.67k • 111 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 1.2M • • 597
DeepSeek-VL model series
DeepSeek LLM series
DeepSeek MoE series
-
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • 685B • Updated • 204k • • 994 -
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation • 685B • Updated • 891 • 67 -
deepseek-ai/DeepSeek-V3.2
Text Generation • 685B • Updated • 5.27M • • 1.43k -
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation • Updated • 10.8k • • 703
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 3.83M • • 13.3k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 5.19k • 957 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 126k • • 773 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 859k • • 1.56k
DeepSeek Math series
-
deepseek-ai/DeepSeek-Math-V2
Text Generation • 685B • Updated • 626 • 694 -
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 39.1k • 151 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 4.01k • 94 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 3.65k • 89
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-V2-Chat-0628
Text Generation • 236B • Updated • 4.33k • 178 -
deepseek-ai/DeepSeek-V2-Chat
Text Generation • 236B • Updated • 8.39k • 462 -
deepseek-ai/DeepSeek-V2
Text Generation • 236B • Updated • 4.3k • 334 -
deepseek-ai/DeepSeek-V2-Lite
Text Generation • 16B • Updated • 509k • 175
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 118k • 686 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 848 • 81 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 2.67k • 111 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 1.2M • • 597
models for paper expert-specialized fine-tuning
DeepSeek-VL model series
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 5.9k • 567 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 191k • 493 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 185k • 151 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • Updated • 38.6k • 163
DeepSeek LLM series
DeepSeek MoE series