HuggingFaceH4/ultrachat_200k
Viewer • Updated • 515k • 71.8k • 707
How to use dball/zephyr-tiny-sft-qlora-quantized-2 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("TinyLlama/TinyLlama-1.1B-Chat-v1.0")
model = PeftModel.from_pretrained(base_model, "dball/zephyr-tiny-sft-qlora-quantized-2")This model is a fine-tuned version of TinyLlama/TinyLlama-1.1B-Chat-v1.0 on the HuggingFaceH4/ultrachat_200k dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 1.1365 | 1.0 | 18257 | 1.1440 |
Base model
TinyLlama/TinyLlama-1.1B-Chat-v1.0