view article Article Back to The Future: Evaluating AI Agents on Predicting Future Events +5 Jul 17 • 49
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content Paper • 2406.11811 • Published Jun 17, 2024 • 16