Listener-Rewarded Thinking in VLMs for Image Preferences Paper • 2506.22832 • Published Jun 28 • 23 • 1
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models Paper • 2506.06395 • Published Jun 5 • 131
Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long Contexts Paper • 2506.05229 • Published Jun 5 • 38
CLEAR: Character Unlearning in Textual and Visual Modalities Paper • 2410.18057 • Published Oct 23, 2024 • 209
Aligning Diffusion Models with Noise-Conditioned Perception Paper • 2406.17636 • Published Jun 25, 2024 • 27