As organizations deploy AI agents to execute tasks in business processes, no systematic academic framework exists for recording, verifying, and ensuring accountability for their actions. This study surveys industrial challenges and regulatory trends in AI agent action recording, formalizes a threat model (T1–T8), and constructs a seven-dimension evaluation framework (D1–D7). Through systematic survey of 16 academic papers and 5 industry projects and comparison matrix analysis, it identifies research gaps and demonstrates the non-substitutability of architectural differences against the two closest existing works (PROV-AGENT, InALign). The survey confirms that "a cryptographically verifiable, independent third-party recording standard for AI agent runtime behavior across platforms" is a unique research area not addressed by any existing work. This paper proposes AEGIS (Agent Execution Governance and Integrity Standard), a three-layer reference architecture targeting these requirements, and evaluates it through a zero-dependency reference implementation with 202 tests across metrics M1–M6 and scenarios S1–S5. Reference implementation: https://github.com/crabsatellite/aegis-protocol
Building similarity graph...
Analyzing shared references across papers
Loading...
Alex Li (Wed,) studied this question.
www.synapsesocial.com/papers/69b3ac3f02a1e69014ccdba7 — DOI: https://doi.org/10.5281/zenodo.18955102
Alex Li
Building similarity graph...
Analyzing shared references across papers
Loading...