1 RAPID: An Efficient Reinforcement Learning Algorithm for Small Language Models University of Pennsylvania
1 BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks University of Pennsylvania 2