HuggingFaceH4
/

mistral-7b-anthropic

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Model card Files Files and versions

lewtun HF Staff commited on Feb 1, 2024

Commit

5734eec

·

verified ·

1 Parent(s): 313cbf5

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 # Mistral 7B Constitutional AI
-This model is a DPO-aligned version of [HuggingFaceH4/mistral-7b-cai](https://huggingface.co/HuggingFaceH4/mistral-7b-cai) on the HuggingFaceH4/ultrafeedback_binarized_fixed and the HuggingFaceH4/cai-conversation-harmless datasets.
 It achieves the following results on the evaluation set:
 - Loss: 0.6327

 # Mistral 7B Constitutional AI
+This model is a DPO-aligned version of Mistral 7B on the HuggingFaceH4/ultrafeedback_binarized_fixed and the HuggingFaceH4/cai-conversation-harmless datasets.
 It achieves the following results on the evaluation set:
 - Loss: 0.6327