Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
10
16
5
Deqing Fu
PRO
deqing
Follow
adamm-hf's profile picture
Mi6paulino's profile picture
tahamajs's profile picture
12 followers
·
17 following
https://deqingfu.github.io
DeqingFu
DeqingFu
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 hours ago
deqing/llama-300M-v5-window_8
updated
a model
about 2 hours ago
deqing/llama-300M-v5-window_2
published
a model
about 2 hours ago
deqing/llama-300M-v5-window_8
View all activity
Organizations
deqing
's models
93
Sort: Recently updated
deqing/llama-300M-v5-window_2
0.3B
•
Updated
9 minutes ago
deqing/llama-300M-v5-window_8
0.3B
•
Updated
34 minutes ago
deqing/lstm-12layer-v5
0.2B
•
Updated
36 minutes ago
deqing/llama-300M-v5-swap_numbers
Text Generation
•
0.3B
•
Updated
about 1 hour ago
•
1.48k
deqing/llama-300M-v5-window_4
Text Generation
•
0.3B
•
Updated
about 1 hour ago
•
2.18k
deqing/llama-300M-v5-isolate
Text Generation
•
0.3B
•
Updated
about 3 hours ago
•
3.86k
deqing/llama-300M-v5-original
Text Generation
•
0.3B
•
Updated
about 3 hours ago
•
2.71k
deqing/fone-llama-3.2-1B-fineweb-sample-100BT-fone3d-hybrid-tile-v4
Updated
about 7 hours ago
•
642
deqing/mamba2-300M-v5-mamba2
Text Generation
•
0.3B
•
Updated
about 8 hours ago
•
884
deqing/llama-300M-v5-unk_number
Text Generation
•
0.3B
•
Updated
1 day ago
•
1.9k
deqing/gdn-300M-v5-gdn
Text Generation
•
0.3B
•
Updated
1 day ago
•
877
deqing/llama-300M-v5-addition_3digit_adamw
0.3B
•
Updated
3 days ago
•
1.26k
deqing/llama-300M-v5-addition_3digit
0.3B
•
Updated
3 days ago
•
1.72k
deqing/llama-300M-v5-addition
Text Generation
•
0.3B
•
Updated
3 days ago
•
3.93k
deqing/llama-300M-v5-addition_adamw
Text Generation
•
0.3B
•
Updated
3 days ago
•
4.14k
deqing/llama-300M-v5-addition_adamw-old
0.3B
•
Updated
6 days ago
•
349
deqing/llama-300M-v5-addition_3digit-old
0.3B
•
Updated
6 days ago
deqing/llama-300M-v5-adamw-addition_3digit_adamw-old
0.3B
•
Updated
6 days ago
deqing/llama-300M-v5-original-random_init_sft
Updated
6 days ago
•
1
deqing/llama-300M-v5-isolate_sft
Updated
7 days ago
•
1
deqing/llama-300M-v5-swap_numbers_sft
Updated
7 days ago
deqing/llama-300M-v5-addition-old
0.3B
•
Updated
7 days ago
•
1.59k
deqing/llama-300M-v5-original_sft
Updated
7 days ago
•
5
deqing/llama-300M-v5-unigram
Text Generation
•
0.3B
•
Updated
7 days ago
•
1.63k
deqing/llama-300M-v5-bigram
Text Generation
•
0.3B
•
Updated
7 days ago
•
1.63k
deqing/lstm-window-4-v5
Text Generation
•
0.2B
•
Updated
8 days ago
•
1.7k
deqing/llama-300M-v5-fivegram
Text Generation
•
0.3B
•
Updated
9 days ago
•
1.77k
deqing/llama-300M-v5-base_7
Text Generation
•
0.3B
•
Updated
10 days ago
•
2.02k
deqing/llama-300M-v5-permute
Text Generation
•
0.3B
•
Updated
10 days ago
•
1.64k
deqing/llama-300M-v5-isolate-old
Text Generation
•
0.3B
•
Updated
11 days ago
•
1.97k
Previous
1
2
3
4
Next