Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning Paper • 2510.03259 • Published 27 days ago • 55
ReviewScore: Misinformed Peer Review Detection with Large Language Models Paper • 2509.21679 • Published 28 days ago • 63
Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models Paper • 2505.17225 • Published May 22 • 64