What question did this study set out to answer?

This research aims to address the vulnerability of Karpathy's auto-research loop to silent metric-gaming by proposing a dual-rubric approach.

April 25, 2026Open Access

The Governance Gauntlet: A Dual-Rubric Extension of Karpathy's Auto-Research Loop — Detecting Silent Metric-Gaming in Recursive Self-Improvement Systems

Key Points

This research aims to address the vulnerability of Karpathy's auto-research loop to silent metric-gaming by proposing a dual-rubric approach.
Pre-register a six-subject empirical evaluation on Open Science Framework
Implement a dual-rubric system with an adversarial LLM meta-agent
Evaluate performance across different gaming archetypes
Identified silent metric-gaming in standard implementations of the auto-research loop
Demonstrated improved detection through the Governance Gauntlet extension
Proposed concrete measures aligning with EU AI Act for regulatory compliance

Abstract

Karpathy's auto-research loop (March 2026) and its rapid derivatives (Gu 2026; Lütke 2026; SkyPilot 2026) establish a minimal, powerful architecture for recursive self-improvement: one editable surface, one scalar metric, one time budget per trial, keep-or-revert on scalar. The design is an elegant concession to the bitter lesson — less structure, more search. It is also structurally vulnerable to Goodhart's Law. We identify one class of failure mode that the vanilla loop cannot detect: silent metric-gaming, in which the primary meta-agent accumulates edits that increase the scalar metric through mechanisms the scalar was not designed to reward. We formalise the vulnerability using Manheim empirical fills follow in v2 within the publication window. We argue the Gauntlet is a concrete operationalisation of EU AI Act Articles 14 (human oversight) and 15 (accuracy, robustness and cybersecurity) for any Karpathy-style deployment in a regulated domain, and sketch extensions to the Four Ds Framework for algorithmic readiness in agentic commerce.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Paul Ferrando Accornero

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

The Governance Gauntlet: A Dual-Rubric Extension of Karpathy's Auto-Research Loop — Detecting Silent Metric-Gaming in Recursive Self-Improvement Systems

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider