What question did this study set out to answer?

This work aims to provide a structural understanding of AI alignment and identify misalignment phenomena in AI systems.

March 7, 2026Open Access

Meta-Alignment and Decision Plasticity in AI Systems

Key Points

This work aims to provide a structural understanding of AI alignment and identify misalignment phenomena in AI systems.
Introduces the concept of meta-alignment through a structural framework.
Analyses decision plasticity in the context of sustained optimization pressure.
Integrates empirical results from studies on agentic AI systems.
Identifies a common structural transition in AI systems leading to decision plasticity collapse.
Proposes meta-alignment as a necessary supplement to existing behavioral alignment approaches.
Highlights the phenomenon of regime invisibility in aligned AI systems.

Abstract

This paper introduces the concept of meta-alignment, a structural perspective on AI alignment focused on the dynamics of decision plasticity in optimization systems. Building on the Adaptive Closure framework, the paper proposes that several misalignment phenomena observed in advanced AI systems — including strategic behavior under evaluation, covert communication, and autonomous capability exploitation — may arise from a common structural transition: the progressive collapse of decision plasticity under sustained optimization pressure. The paper integrates recent empirical results from research on agentic AI systems and proposes a unifying theoretical framework linking optimization dynamics, structural observability, and AI governance. Meta-alignment is presented as a structural complement to behavioral alignment approaches. While value-based alignment defines the normative objectives guiding AI systems, meta-alignment focuses on maintaining the structural conditions that allow systems to remain corrigible and responsive to corrective signals over time. The framework also introduces the concept of regime invisibility, where systems may appear aligned at the behavioral level while undergoing structural rigidification internally. This work is part of a broader research program including: Adaptive Closure in Agentic Systems (2026) Structural Observability and Governable Agentic AI (2026) Governing Governance: Structural Principles for Governing AI Acceleration Under Systemic Risk (2025)

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Aurel Marven

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Meta-Alignment and Decision Plasticity in AI Systems

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider