LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Paper • 2407.14057 • Published Jul 19, 2024 • 46
Orak: A Foundational Benchmark for Training and Evaluating LLM Agents on Diverse Video Games Paper • 2506.03610 • Published Jun 4 • 9
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper • 2507.05687 • Published Jul 8 • 27
ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long Narrative Reasoning Paper • 2508.10419 • Published Aug 14 • 73
TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling Paper • 2510.01698 • Published 16 days ago • 4
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 12 days ago • 401