GRACE: Generative Representation Learning via Contrastive Policy Optimization Paper • 2510.04506 • Published 21 days ago • 10
s3: You Don't Need That Much Data to Train a Search Agent via RL Paper • 2505.14146 • Published May 20 • 18
DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning Paper • 2503.00223 • Published Feb 28 • 1
RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation Paper • 2502.10996 • Published Feb 16 • 1