Measuring what Matters: Construct Validity in Large Language Model Benchmarks Paper • 2511.04703 • Published Nov 3, 2025 • 8
Privacy Collapse: Benign Fine-Tuning Can Break Contextual Privacy in Language Models Paper • 2601.15220 • Published Jan 21 • 9
A Stable, Fast, and Fully Automatic Learning Algorithm for Predictive Coding Networks Paper • 2212.00720 • Published Nov 16, 2022