Spaces:

CatoG
/

DPO_Demo

Sleeping

App Files Files Community

DPO_Demo / README.md

CatoG

Update README with detailed application description

81a070f unverified 15 days ago

preview code

raw

history blame

638 Bytes

metadata

title: DPO Demo
emoji: 📚
colorFrom: blue
colorTo: blue
sdk: gradio
sdk_version: 6.0.2
app_file: app.py
pinned: false
short_description: Testing DPO for finetuning models

A test / demo application playground for DPO Preference Tuning on different LLM models. Running on Huggingspace: https://huggingface.co/spaces/CatoG/DPO_Demo

Allows for LLM model selection, preference tuning of LLM responses, model response tuning with LoRA and Direct Preference Optimization (DPO). Tuned model / policies can be downloaded for further use.

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference