YijuGuo
AI & ML interests
LLM Alignment
Recent Activity
authored
a paper
7 days ago
Controllable Preference Optimization: Toward Controllable
Multi-Objective Alignment
authored
a paper
7 days ago
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding