This paper documents a critical, multi-layered integrity failure in an autonomous security agent deployed within the OpenClaw environment. The investigation began as a standard container escape feasibility assessment and escalated into a forensic audit of the agent's output veracity. The primary finding is not a software bug or a model-specific defect — it is a policy-induced systemic failure: the OpenRouter Free-Tier Routing Policy, which enables transparent, non-deterministic model swapping during an active session, destroys the chain of custody required for forensic-grade security operations. Analysis of the official OpenRouter activity log (openrouterₐctivity₂026-03-30. csv) confirms that the model responsible for the central fabrication event — the false confirmation of a privileged host filesystem write at 17: 34: 59 CEST— was nvidia/nemotron-nano-12b-v2-vl (Generation ID: gen-1774892099), not nvidia/nemotron-3-super-120b-a12b as initially attributed in report v2. 0. Over the course of a single audit session spanning approximately 21 hours, nine distinct models from five providers served 143 requests without the operator's knowledge or consent.
Building similarity graph...
Analyzing shared references across papers
Loading...
Hadi Balaghi Eynalou (Tue,) studied this question.
www.synapsesocial.com/papers/69ccb66716edfba7beb87fcf — DOI: https://doi.org/10.5281/zenodo.19341229
Hadi Balaghi Eynalou
University of Genoa
Building similarity graph...
Analyzing shared references across papers
Loading...