When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA Paper • 2510.04849 • Published 13 days ago • 97
OrtSAE: Orthogonal Sparse Autoencoders Uncover Atomic Features Paper • 2509.22033 • Published 23 days ago • 16
The Rogue Scalpel: Activation Steering Compromises LLM Safety Paper • 2509.22067 • Published 23 days ago • 26
HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds Paper • 2508.12782 • Published Aug 18 • 25
xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics Paper • 2406.14553 • Published Jun 20, 2024 • 2
ViSTa Dataset: Do vision-language models understand sequential tasks? Paper • 2411.13211 • Published Nov 20, 2024