Research into RLAIF (Reinforcement Learning from AI feedback) with the goal of Constitutional AI and Sycophancy Resistance.