HuggingFaceH4
/

mistral-7b-anthropic

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Model card Files Files and versions

lewtun HF Staff commited on Feb 1, 2024

Commit

313cbf5

·

verified ·

1 Parent(s): 728c8ea

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -15,9 +15,10 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# mistral-7b-dpo-v21.0cai.0.2
-This model is a fine-tuned version of [HuggingFaceH4/mistral-7b-cai](https://huggingface.co/HuggingFaceH4/mistral-7b-cai) on the HuggingFaceH4/ultrafeedback_binarized_fixed and the HuggingFaceH4/cai-conversation-harmless datasets.
 It achieves the following results on the evaluation set:
 - Loss: 0.6327
 - Rewards/chosen: -9.8716

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Mistral 7B Constitutional AI
+This model is a DPO-aligned version of [HuggingFaceH4/mistral-7b-cai](https://huggingface.co/HuggingFaceH4/mistral-7b-cai) on the HuggingFaceH4/ultrafeedback_binarized_fixed and the HuggingFaceH4/cai-conversation-harmless datasets.
 It achieves the following results on the evaluation set:
 - Loss: 0.6327
 - Rewards/chosen: -9.8716