floyed shen
floyed
AI & ML interests
None yet
Recent Activity
upvoted a paper about 6 hours ago
From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation upvoted a paper about 6 hours ago
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information upvoted a paper 16 days ago
Safety Instincts: LLMs Learn to Trust Their Internal Compass for Self-Defense