What question did this study set out to answer?

The central aim is to present an architecture for effectively governing the use of Large Language Model APIs in enterprise settings.

February 14, 2026Open Access

Toward Deterministic Governance of Large Language Model APIs

Key Points

The central aim is to present an architecture for effectively governing the use of Large Language Model APIs in enterprise settings.
Introduced IAGA control-plane architecture for API governance
Implemented cost containment and reliability mechanisms
Detailed request lifecycle and caching architecture
Conducted performance evaluation under simulated enterprise workloads
Established deterministic cost enforcement and budget guarantees
Achieved circuit-breaker-based reliability for system integrity
Enabled synchronous response validation enhancing response accuracy
Provided cryptographically scoped isolation for multi-tenancy security

Abstract

This preprint presents IAGA, a control-plane architecture for governing Large Language Model (LLM) API usage in enterprise environments. The system introduces deterministic cost enforcement, bounded budget guarantees, circuit-breaker-based reliability, synchronous response validation, and cryptographically scoped multi-tenant isolation. IAGA operates as an OpenAI-compatible gateway, requiring zero application-level changes while enforcing governance at request time. The paper details the full request lifecycle, cost containment model, caching architecture, reliability mechanisms, and controlled performance evaluation under simulated enterprise workloads.

Toward Deterministic Governance of Large Language Model APIs

Key Points

Abstract

Cite This Study