ConicCat's picture
Update README.md
5523d7e verified
metadata
base_model:
  - swiss-ai/Apertus-8B-2509
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - apertus
  - trl
  - cpo
license: apache-2.0
language:
  - en

Actually functional this time Alpha/SimPO checkpoint trained from Apertus base.

Trained on a mix of curated C2, Gutenberg, and Instruct Skill-Mix

Alpaca chat template, temp .5, min_p .05, rep pen 1.05 seems reasonable.

Now I just have to make a better preference dataset...