Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2224.2
TFLOPS
21
16
30
Loser Cheems
JingzeShi
Follow
vasuji's profile picture
Alliance529's profile picture
kroeke's profile picture
51 followers
·
20 following
https://github.com/LoserCheems
LoserCheems
AI & ML interests
I like training small languge models.
Recent Activity
posted
an
update
about 1 month ago
Is it time to start developing sparse attention again? https://github.com/SmallDoges/flash-sparse-attention
upvoted
a
paper
about 1 month ago
A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
upvoted
an
article
about 1 month ago
From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels
View all activity
Organizations
JingzeShi
's models
7
Sort: Recently updated
JingzeShi/OpenSeek-1.4B-A0.4B-KTO
Text Generation
•
1B
•
Updated
Sep 9
•
5
JingzeShi/OpenSeek-1.4B-A0.4B
Text Generation
•
1B
•
Updated
Aug 24
•
4
JingzeShi/Doge-20M
Text Generation
•
37.6M
•
Updated
Jul 5
•
17
JingzeShi/Doge-320M-Reason-checkpoint
0.4B
•
Updated
May 15
•
4
JingzeShi/Doge-320M-Reason-Distill
Text Generation
•
0.3B
•
Updated
Mar 29
•
5
JingzeShi/Doge-120M-MoE
0.1B
•
Updated
Mar 20
•
8
JingzeShi/Mixtral-7B-v0.1
Text Generation
•
7B
•
Updated
Mar 4
•
4