SDPO: Segment-Level Direct Preference Optimization for Social Agents Paper • 2501.01821 • Published Jan 3 • 20
Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs Paper • 2407.08995 • Published Jul 12, 2024 • 1