What question did this study set out to answer?

This research aims to enhance the understanding of computational divergence in agentic AI systems by integrating multiple representations.

May 21, 2026Open Access

Connecting Activation Geometry to Execution Intent: A Multi-Representation Framework for Detecting Computational Divergence in Agentic AI

Key Points

This research aims to enhance the understanding of computational divergence in agentic AI systems by integrating multiple representations.
Developed a four-layer measurement framework connecting language input, reasoning, activation state, and execution representation.
Formalized a six-case divergence taxonomy for analyzing inter-layer agreement and disagreement.
Outlined a research agenda to empirically apply the framework using kinematic strain verbalization.
Silent divergence was identified as undetectable if only semantic or activation approaches are employed.
The execution layer is crucial for accurate measurement of computational intent.
The framework provides a roadmap for future research in agentic AI systems.

Abstract

The chain-of-thought scratchpad is the primary surface through which model reasoning is observed and governed in agentic AI systems, but it is an incomplete and potentially unfaithful representation of underlying computation. This paper proposes a four-layer measurement framework that treats the scratchpad as one representation among several rather than as ground truth, connecting natural language input, chain-of-thought reasoning, NLA-verbalized activation state (Fraser-Taliente et al., 2026), and kinematic execution representation as independent witnesses to the same computational event. We formalize a six-case divergence taxonomy characterizing what each pattern of inter-layer agreement and disagreement implies about the nature and location of computational misalignment. A key finding is that silent divergence — where expressed reasoning is faithful to the activation state but both misrepresent execution intent — is undetectable by any approach operating exclusively at the semantic or activation level, establishing the execution layer as a structurally irreplaceable measurement component. We describe a research agenda for empirically instantiating the framework, beginning with NLA verbalization at kinematic strain breach points as a near-term contribution achievable on consumer-accessible compute using publicly available infrastructure.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Cook et al. (Tue,) studied this question.

synapsesocial.com/papers/6a0ea17cbe05d6e3efb6027d https://doi.org/https://doi.org/10.5281/zenodo.20280607

Bookmark

View Full Paper