Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pszemraj
/
mGPT-Peter-2E
like
0
Text Generation
Transformers
PyTorch
Safetensors
mc4
Wikipedia
gpt2
multilingual
PyTorch
Transformers
gpt3
Deepspeed
Megatron
mGPT
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
mGPT-Peter-2E
12.2 GB
2 contributors
History:
1 commit
pszemraj
SFconvertbot
Super-squash branch 'main' using huggingface_hub
7a50d00
verified
2 days ago
.gitattributes
Safe
1.23 kB
Super-squash branch 'main' using huggingface_hub
2 days ago
.gitignore
Safe
13 Bytes
Super-squash branch 'main' using huggingface_hub
2 days ago
README.md
Safe
4.41 kB
Super-squash branch 'main' using huggingface_hub
2 days ago
config.json
Safe
725 Bytes
Super-squash branch 'main' using huggingface_hub
2 days ago
latest
Safe
15 Bytes
Super-squash branch 'main' using huggingface_hub
2 days ago
merges.txt
Safe
1.2 MB
Super-squash branch 'main' using huggingface_hub
2 days ago
model.safetensors
Safe
6.07 GB
xet
Super-squash branch 'main' using huggingface_hub
2 days ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
6.07 GB
xet
Super-squash branch 'main' using huggingface_hub
2 days ago
rng_state_0.pth
pickle
Detected Pickle imports (7)
"numpy.core.multiarray._reconstruct"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
,
"numpy.dtype"
,
"numpy.ndarray"
,
"_codecs.encode"
How to fix it?
14.5 kB
xet
Super-squash branch 'main' using huggingface_hub
2 days ago
special_tokens_map.json
Safe
387 Bytes
Super-squash branch 'main' using huggingface_hub
2 days ago
tokenizer.json
Safe
4.79 MB
Super-squash branch 'main' using huggingface_hub
2 days ago
tokenizer_config.json
Safe
614 Bytes
Super-squash branch 'main' using huggingface_hub
2 days ago
trainer_state.json
Safe
26 kB
Super-squash branch 'main' using huggingface_hub
2 days ago
training_args.bin
pickle
Detected Pickle imports (8)
"transformers.trainer_utils.SchedulerType"
,
"transformers.deepspeed.HfTrainerDeepSpeedConfig"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.training_args.TrainingArguments"
,
"torch.float16"
,
"torch.device"
,
"transformers.training_args.OptimizerNames"
,
"transformers.trainer_utils.IntervalStrategy"
How to fix it?
4.27 kB
xet
Super-squash branch 'main' using huggingface_hub
2 days ago
vocab.json
Safe
1.89 MB
Super-squash branch 'main' using huggingface_hub
2 days ago
zero_to_fp32.py
Safe
18.9 kB
Super-squash branch 'main' using huggingface_hub
2 days ago