Datasets and models in the paper "Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models" [github.com/yuleiqin/RAIF].
Yulei Qin
yolay
AI & ML interests
Medical Imaging, Computer Vision,
Language Models
Recent Activity
upvoted
a
paper
3 days ago
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference
Optimization
upvoted
a
paper
11 days ago
Complex Logical Instruction Generation
upvoted
a
paper
11 days ago
OpenCUA: Open Foundations for Computer-Use Agents
Organizations
None yet