BESPOKE: Benchmark for Search-Augmented Large Language Model Personalization via Diagnostic Feedback Paper • 2509.21106 • Published 27 days ago • 7
MT-RAIG: Novel Benchmark and Evaluation Framework for Retrieval-Augmented Insight Generation over Multiple Tables Paper • 2502.11735 • Published Feb 17