What question did this study set out to answer?

This research examines how alignment affects memory consolidation in language models during continual learning.

February 28, 2026Open Access

The Alignment Tax on Continual Learning: Inverse Scaling of Memory Consolidation in Language Models

Key Points

This research examines how alignment affects memory consolidation in language models during continual learning.
Evaluated memory consolidation across various language model sizes (3B, 8B, 70B)
Assessed factual recall percentages and confabulation incidents after training
Investigated the impact of RLHF alignment and behavioral priors on knowledge retention.
Memory recall decreases from 47% at 3B parameters to zero at 70B despite successful training
Notable confabulation observed at 8B with a recall rate of 37%
Alignment training interferes with knowledge injection in larger models.

Abstract

We report a surprising inverse scaling phenomenon in LoRA-based memory consolidation for language models. At 3B parameters, sleep-wake consolidation achieves 47% factual recall after training. At 8B, recall drops to 37% with significant confabulation. At 70B, recall is zero despite successful training (low loss, correct gradient flow). We identify RLHF alignment as the cause: safety training creates a behavioral prior that overrides LoRA-injected knowledge at inference time. The effect scales with model size because larger models receive more extensive alignment training. This 'alignment tax' on continual learning has implications for any system attempting to inject new knowledge into aligned language models via parameter-efficient fine-tuning.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Vladimir Baranov (Sun,) studied this question.

www.synapsesocial.com/papers/69a287350a974eb0d3c02bb2 — DOI: https://doi.org/10.5281/zenodo.18778761

The Alignment Tax on Continual Learning: Inverse Scaling of Memory Consolidation in Language Models

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion