-
MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge
Paper • 2507.21183 • Published • 14 -
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE
Paper • 2507.21802 • Published • 13 -
EDGE-GRPO: Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity
Paper • 2507.21848 • Published • 8 -
Agentic Reinforced Policy Optimization
Paper • 2507.19849 • Published • 150
Oğuzhan Ercan
oguzhanercan
AI & ML interests
deep representation learning
Recent Activity
updated
a collection
about 16 hours ago
Voice
updated
a collection
about 16 hours ago
Finetuning Strategies
updated
a collection
about 16 hours ago
MultiModal Reasoning
Organizations
None yet