Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
igorktech
/
Skommarkhos-7b-ARPO-v5
like
0
Transformers
Safetensors
Generated from Trainer
trl
cpo
arxiv:
2401.08417
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Skommarkhos-7b-ARPO-v5
857 MB
1 contributor
History:
13 commits
igorktech
End of training
b53c76f
verified
7 months ago
.gitattributes
1.52 kB
initial commit
7 months ago
README.md
2.57 kB
End of training
7 months ago
adapter_config.json
885 Bytes
Training in progress, step 30
7 months ago
adapter_model.safetensors
852 MB
xet
End of training
7 months ago
chat_template.jinja
284 Bytes
Training in progress, step 30
7 months ago
generation_config.json
252 Bytes
Training in progress, step 30
7 months ago
special_tokens_map.json
317 Bytes
Training in progress, step 30
7 months ago
tokenizer.json
4.67 MB
Training in progress, step 30
7 months ago
tokenizer_config.json
1.91 kB
Training in progress, step 180
7 months ago
training_args.bin
6.23 kB
xet
Training in progress, step 30
7 months ago