What question did this study set out to answer?

To formalize the AI In The Loop (AITL) paradigm, proposing a taxonomy for closed-loop autonomous systems.

April 15, 2026Open Access

AI in the Loop (AITL): A Systems Taxonomy for Closed-Loop Autonomous Evaluation

Key Points

To formalize the AI In The Loop (AITL) paradigm, proposing a taxonomy for closed-loop autonomous systems.
Analysis of four systems: AlphaZero, Constitutional AI, SWE-agent, autoresearch.
Validation through a controlled experiment using the Autonomous Empirical Optimization System (AEOS).
Autonomous LLM agents built ML pipelines on a semantically-stripped dataset.
Human supervision shifts from iterative ML engineering to boundary supervision.
Demonstrated autonomous agent stopping behavior in experiments.
Identified failure modes, including the novel Sunk-Cost Continuation failure mode.

Abstract

We identify and formalize AI In The Loop (AITL), a paradigm where AI systems autonomously generate, evaluate, and improve with human intervention restricted to boundary supervision rather than operational decision-making. AITL extends the RLAIF principle—replacing human feedback with AI feedback—from training to the full AI system lifecycle. This is a framework paper with proof-of-concept validation; we propose a unifying taxonomy rather than a benchmark study. Through analysis of four systems (AlphaZero, Constitutional AI, SWE-agent, autoresearch), we extract common properties and propose a unifying taxonomy: self-generation, self-evaluation, self-improvement, and human observation. We validate AITL through a controlled experiment using the Autonomous Empirical Optimization System (AEOS), a model-agnostic ML sandbox where two LLM agents autonomously built ML pipelines on a semantically-stripped dataset. In our experiment, we observe that the dominant human role in AITL shifts from iterative ML engineering (O(n) per iteration) to boundary supervision (O(1) per experiment). Our contributions are: (1) formalization of AITL as a unifying framework for closed-loop autonomous systems, (2) a taxonomy connecting existing systems under shared properties, (3) empirical validation via AEOS demonstrating autonomous agent stopping behavior, and (4) identification of failure modes including a novel Sunk-Cost Continuation failure mode (F6), where agents continue low-yield exploration despite prolonged stagnation. We position AITL as a natural evolution of AI evaluation, suggesting scalable directions infeasible under HITL constraints.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Sanskar jajoo

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

AI in the Loop (AITL): A Systems Taxonomy for Closed-Loop Autonomous Evaluation

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider