Memory Collection Prompt is text-based memory. System II prompting is updating memory. Parametric memory is long-term, while prompt-based are short-tem. • 23 items • Updated Oct 22 • 2
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published Oct 21 • 109
Executable Knowledge Graphs for Replicating AI Research Paper • 2510.17795 • Published Oct 20 • 14
When Benchmarks Age: Temporal Misalignment through Large Language Model Factuality Evaluation Paper • 2510.07238 • Published Oct 8 • 14
BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses Paper • 2510.00232 • Published Sep 30 • 15
OceanGym: A Benchmark Environment for Underwater Embodied Agents Paper • 2509.26536 • Published Sep 30 • 34
Towards Personalized Deep Research: Benchmarks and Evaluations Paper • 2509.25106 • Published Sep 29 • 29
Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL Paper • 2508.07976 • Published Aug 11 • 51
Automating Steering for Safe Multimodal Large Language Models Paper • 2507.13255 • Published Jul 17 • 3
view article Article 从知识更新到行为调控: 基于 EasyEdit 的大模型知识编辑框架 Jul 15 • 5
view article Article Take Control of What Your LLM Knows and Does — with the EasyEdit Tool Series Jul 15 • 6
ReCode: Updating Code API Knowledge with Reinforcement Learning Paper • 2506.20495 • Published Jun 25 • 9
Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs Paper • 2506.19290 • Published Jun 24 • 52
Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study Paper • 2506.19794 • Published Jun 24 • 8
KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality Paper • 2506.19807 • Published Jun 24 • 7