Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published Mar 14 • 88
Efficient Agent Evaluation via Diversity-Guided User Simulation Paper • 2604.21480 • Published 11 days ago • 14
Efficient Agent Evaluation via Diversity-Guided User Simulation Paper • 2604.21480 • Published 11 days ago • 14
CLEAR: Error Analysis via LLM-as-a-Judge Made Easy Paper • 2507.18392 • Published Jul 24, 2025 • 20
Effective Red-Teaming of Policy-Adherent Agents Paper • 2506.09600 • Published Jun 11, 2025 • 39
Effective Red-Teaming of Policy-Adherent Agents Paper • 2506.09600 • Published Jun 11, 2025 • 39 • 2
Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language Models Paper • 2505.19621 • Published May 26, 2025 • 4
Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language Models Paper • 2505.19621 • Published May 26, 2025 • 4
Think Again! The Effect of Test-Time Compute on Preferences, Opinions, and Beliefs of Large Language Models Paper • 2505.19621 • Published May 26, 2025 • 4 • 2
TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations Paper • 2505.18125 • Published May 23, 2025 • 112
AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Paper • 2503.19693 • Published Mar 25, 2025 • 76
AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Paper • 2503.19693 • Published Mar 25, 2025 • 76
AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Paper • 2503.19693 • Published Mar 25, 2025 • 76 • 2
Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In Paper • 2410.16950 • Published Oct 22, 2024 • 1