What question did this study set out to answer?

The aim is to develop a framework that allows autonomous AI agents to operate securely and accountably in high-stakes environments.

February 12, 2026Open Access

TRACE: Trusted Runtime for Autonomous Containment and Evidence

Puntos clave

The aim is to develop a framework that allows autonomous AI agents to operate securely and accountably in high-stakes environments.
Introduced TRACE as a governance-first execution framework for AI agents.
Implemented cryptographically signed policy bundles for operations.
Defined assurance properties and proof obligations related to mediation and evidence invariants.
Developed an Interface Gateway for complete mediation of actions.
Created LLM-specific tripwire metrics for operational monitoring.
Established ten assurance properties with specified dependencies and proof obligations.
Developed a signed evidence log with timestamps for audit and forensics.
Introduced graduated containment levels for different operational risks.
Released a reference implementation of core algorithms as a starting point for further development.

Resumen

Autonomous AI agents increasingly take actions against external systems in environments where mistakes are costly and where post-hoc logs are insufficient for governance. We introduce TRACE (Trusted Runtime for Autonomous Containment and Evidence), a governance-first execution framework that treats the execution agent as untrusted and derives assurance from infrastructure mediation rather than model behavior. TRACE executes each operation under a cryptographically signed policy bundle that pinpoints tool definitions and encodes authorization boundaries, constraints, tripwire predicates, isolation-tier selection, and success criteria. An Interface Gateway enforces complete mediation for all boundary-crossing actions, while independent boundary instrumentation (Y) provides telemetry that deterministic tripwires evaluate to trigger graduated containment levels (L0–L5) and fail-closed halts. TRACE produces a hash-chained, signed evidence log anchored with RFC 3161 timestamps, enabling audit reconstruction and post-incident forensics. We specify ten assurance properties, their dependencies on explicit deployment assumptions, and the corresponding proof obligations for mediation and evidence invariants. To make enforcement auditable without access to model internals, TRACE defines LLM-specific tripwire metrics for repetition (ARI), post-response divergence (PRDS), and plan-trajectory deviation (TMD), and provides an enumerable formal model of the containment finite-state machine. TRACE is presented as an architecture and specification; a research-grade skeleton reference implementation of core algorithms is released alongside this preprint, while full production implementation and empirical validation remain future work.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Elias Calboreanu (Tue,) studied this question.

www.synapsesocial.com/papers/698d6e925be6419ac0d54602 — DOI: https://doi.org/10.5281/zenodo.18600706

TRACE: Trusted Runtime for Autonomous Containment and Evidence

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion