What question did this study set out to answer?

The research aims to develop a memory consolidation system for large language models that enhances knowledge retention and retrieval.

April 21, 2026Open Access

SAE Feature-Triggered Subspace Weight Consolidation for Large Language Models with Full Memory Lifecycle Dynamics

Key Points

The research aims to develop a memory consolidation system for large language models that enhances knowledge retention and retrieval.
Implemented a memory consolidation system with SAE-triggered retrieval and structured forgetting.
Evaluated on 1,000 CounterFact cases to assess efficacy, generality, and specificity.
Used constrained least-squares updates and orthogonalization techniques for weight consolidation.
Achieved 78.0% efficacy, 77.6% generality, and 100% specificity in memory recall.
Demonstrated a 29× smaller weight perturbation compared to standard fine-tuning techniques.
Confirmed localized rank reduction effects and the effectiveness of Ebbinghaus-style forgetting dynamics.

Abstract

We present a memory consolidation system for large language models (LLMs) implementing a full knowledge lifecycle: continuous fact extraction from dialogue, context-sensitive SAE-triggered retrieval, offline subspace weight consolidation, and structured forgetting via time-based strength decay. The system uses GemmaScope-2 sparse autoencoder (SAE) features at layer 16 of Gemma-3-4B as memory retrieval triggers, and performs constrained least-squares updates to MLP down-projection weights at layers 25–27 with Gram-Schmidt orthogonalization to protect consolidated knowledge. Evaluated on 1,000 CounterFact cases, the method achieves 78.0% efficacy, 77.6% generality, and 100% specificity, with a 29× smaller weight perturbation than standard fine-tuning (ΔW = 0.004 vs. 0.118). Mechanistic analysis confirms that rank reduction is localized to layers 25–27, with a 3.4× edit-specificity ratio. The system also reproduces Ebbinghaus-style forgetting dynamics and demonstrates parametric implicit memory persistence after index clearance. This is a preliminary preprint (v1.0, April 2026). Code and updated versions will be released in future revisions.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

H Zhang

Actions

Institutions

Kyushu University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

SAE Feature-Triggered Subspace Weight Consolidation for Large Language Models with Full Memory Lifecycle Dynamics

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider