Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
janhq
/
250404-llama-3.2-3b-instruct-grpo-02
like
0
Follow
Jan
740
TensorBoard
Safetensors
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
main
250404-llama-3.2-3b-instruct-grpo-02
/
checkpoint-50
1.19 GB
1 contributor
History:
1 commit
thinhlpg
Upload folder using huggingface_hub
9135103
verified
9 months ago
README.md
5.12 kB
Upload folder using huggingface_hub
9 months ago
adapter_config.json
878 Bytes
Upload folder using huggingface_hub
9 months ago
adapter_model.safetensors
778 MB
xet
Upload folder using huggingface_hub
9 months ago
optimizer.pt
396 MB
xet
Upload folder using huggingface_hub
9 months ago
rng_state.pth
14.2 kB
xet
Upload folder using huggingface_hub
9 months ago
scheduler.pt
1.06 kB
xet
Upload folder using huggingface_hub
9 months ago
special_tokens_map.json
454 Bytes
Upload folder using huggingface_hub
9 months ago
tokenizer.json
17.2 MB
xet
Upload folder using huggingface_hub
9 months ago
tokenizer_config.json
54.7 kB
Upload folder using huggingface_hub
9 months ago
trainer_state.json
31.8 kB
Upload folder using huggingface_hub
9 months ago
training_args.bin
6.07 kB
xet
Upload folder using huggingface_hub
9 months ago