InnoGym: Benchmarking the Innovation Potential of AI Agents Paper • 2512.01822 • Published 8 days ago • 33
InnoGym: Benchmarking the Innovation Potential of AI Agents Paper • 2512.01822 • Published 8 days ago • 33
InnoGym: Benchmarking the Innovation Potential of AI Agents Paper • 2512.01822 • Published 8 days ago • 33 • 2
Executable Knowledge Graphs for Replicating AI Research Paper • 2510.17795 • Published Oct 20 • 14
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published Oct 21 • 110
Memory Collection Prompt is text-based memory. System II prompting is updating memory. Parametric memory is long-term, while prompt-based are short-tem. • 23 items • Updated Oct 22 • 2
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published Oct 21 • 110
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published Oct 21 • 110 • 3
Executable Knowledge Graphs for Replicating AI Research Paper • 2510.17795 • Published Oct 20 • 14
Executable Knowledge Graphs for Replicating AI Research Paper • 2510.17795 • Published Oct 20 • 14 • 2
When Benchmarks Age: Temporal Misalignment through Large Language Model Factuality Evaluation Paper • 2510.07238 • Published Oct 8 • 14
BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses Paper • 2510.00232 • Published Sep 30 • 15
OceanGym: A Benchmark Environment for Underwater Embodied Agents Paper • 2509.26536 • Published Sep 30 • 34
OceanGym: A Benchmark Environment for Underwater Embodied Agents Paper • 2509.26536 • Published Sep 30 • 34