What question did this study set out to answer?

The study aims to enhance the efficiency of computer use agents by integrating a directed graph-based persistent memory system.

June 4, 2026Open Access

Graph-Structured Persistent Memory for Efficient LLM-Based Computer Use Agents

Puntos clave

The study aims to enhance the efficiency of computer use agents by integrating a directed graph-based persistent memory system.
Developed a memory-augmented agent formalized as S=⟨A,Σ,G,δ,π,Φ⟩.
Defined task reachability and memory-coverage conditions using functional stability theory.
Conducted experiments on OSWorld to compare the proposed agent against a memoryless baseline.
The memory-augmented agent reduced LLM token consumption by approximately 50% compared to the baseline.
Execution time was also cut by about 50% while maintaining comparable success rates (≈36.9% on 15-step tasks and ≈46.9% on 50-step tasks).
The contribution emphasizes operational efficiency through the use of reusable graph memory.

Resumen

Large language model (LLM)-driven computer use agents (CUAs) automate graphical user interface (GUI) tasks but often re-solve previously encountered subtasks, increasing token use and latency. We address this limitation with a directed graph-based persistent memory in which nodes represent observable GUI states and edges encode executable action sequences. We formalize the memory-augmented agent as S=⟨A,Σ,G,δ,π,Φ⟩, define task reachability and memory-coverage conditions inspired by functional stability theory, and derive token-cost efficiency bounds. In control-theoretic terms, the Manager–Worker architecture can be interpreted as a closed-loop system where memory provides experience-based feedback; this interpretation is used as an analogy rather than a full model-reference adaptive control proof. Experiments on OSWorld show that the proposed agent cuts both the LLM token consumption and execution time by about 50% versus a memoryless baseline while preserving comparable success rates (≈36.9% on 15-step and ≈46.9% on 50-step tasks). The demonstrated contribution is therefore operational efficiency through reusable graph memory, not a claim of improved task success or classical Lyapunov stability.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo