SDPO: Segment-Level Direct Preference Optimization for Social Agents Paper • 2501.01821 • Published Jan 3, 2025 • 20
Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published Jan 9, 2025 • 60