HUME: Measuring the Human-Model Performance Gap in Text Embedding Task
Paper
• 2510.10062 • Published
• 10
The HUME benchmark is designed to evaluate the performance of text embedding models and humans on a comparable set of tasks. This captures areas wh...