What question did this study set out to answer?

The study aims to develop a framework to evaluate external memory systems for AI agents based on user-centric criteria.

February 5, 2026Open Access

An evaluation framework for AI agents external memory systems.

Puntos clave

The study aims to develop a framework to evaluate external memory systems for AI agents based on user-centric criteria.
Utilized a synthetic persona for evaluations.
Analyzed a dataset of 500 chatbot-style conversations.
Formulated 59 fact-based queries with objective acceptance criteria.
Compared factual recall across four different memory systems: mem0, ChatGPT Memory, Ontbo/Light, and Ontbo v2.
Ontbo v2 achieved 93.2% recall, outperforming other systems.
Stratified analysis showed degradation in performance with bounded or lossy memory strategies as conversation history increased.

Resumen

This paper introduces an evaluation framework for long-term, user-centric external memory systems for AI agents. Using a synthetic persona and a diverse corpus of 500 chatbot-style conversations, we derive 59 fact-based queries with objective acceptance criteria and measure factual recall across four systems: mem0, ChatGPT Memory, Ontbo/Light, and Ontbo v2. Results show that Ontbo v2 reaches 93.2% recall, outperforming the other approaches, and a stratified analysis highlights how bounded or lossy memory strategies degrade as conversation history grows. The paper details the dataset construction, querying protocol, and evaluation methodology to support reproducible, privacy-preserving benchmarking of agent memory and personalization.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Aubry et al. (Tue,) studied this question.

www.synapsesocial.com/papers/69843583f1d9ada3c1fb4550 — DOI: https://doi.org/10.5281/zenodo.18471369

Authors

Stéphane Aubry

Luca Pelissero-Witoslawski

Athénaïs Oslati

Actions

Institutions

Asociación Psicoanalítica de Buenos Aires

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

An evaluation framework for AI agents external memory systems.

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion