Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
TitleOS 's Collections
RLAIF Experimentation
Qwen3 Coder Heretic - Decensored
Spark 270M - Micro Local Utility LLM
Lightning 1.7B - Local Utility LLM
HomePhi4 - Home Assistant Reasoning LLM
HomeGem - Home Assistant Conversational LLM
Galactic Reasoning LoRA Adapters
Experiments

RLAIF Experimentation

updated 15 days ago

Research into RLAIF (Reinforcement Learning from AI feedback) with the goal of Constitutional AI and Sycophancy Resistance.

Upvote
-

  • TitleOS/rlaif_training_fictional_patriot_experiment

    Viewer • Updated 16 days ago • 255 • 37

  • TitleOS/RLAIF_Patriot_Experiment_LoRA

    Updated 16 days ago • 22

  • TitleOS/RLAIF_Patriot_Experiment_Q8_0-GGUF

    38.4M • Updated 15 days ago • 9

  • TitleOS/RLAIF_Patriot_Experiment_F16-GGUF

    38.4M • Updated 15 days ago • 15
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs