Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Monthan Zikes's picture
6

Monthan Zikes

Zieksy

AI & ML interests

None yet

Organizations

None yet

New activity in bigcode/the-stack-v2-train-full-ids 5 months ago

Total Token Count & Total Size (GB/TB) in Full Files of bigcode/the-stack-v2-train-full-ids Dataset

#12 opened 5 months ago by
Zieksy
New activity in bigcode/the-stack-v2-train-smol-ids 5 months ago

Total Token Count & Total Size (GB/TB) in Full Files of bigcode/the-stack-v2-train-smol-ids Dataset

#9 opened 5 months ago by
Zieksy
New activity in nvidia/nemo-megatron-gpt-5B 5 months ago

How Does NeMo Handle Sequences That Exceed the Max Sequence Length?

#6 opened 5 months ago by
Zieksy
commented a paper 5 months ago

Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset

Paper • 2412.02595 • Published Dec 3, 2024 • 5 •
21
New activity in nvidia/Nemotron-H-8B-Base-8K 6 months ago

What’s the Pre-training Data Strategy Behind Nemotron-H?

#3 opened 6 months ago by
Zieksy
New activity in microsoft/Phi-3-mini-4k-instruct 6 months ago

Inquiry About Phi-3 Pre-Training Dataset Composition

1
#103 opened 6 months ago by
Zieksy
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs