Key points are not available for this paper at this time.
The most expensive mistake in Agentic-AI SDLC is believing the agent understands. It doesn't understand — it guesses. This publication systematizes two years of field experience integrating AI-Agents and Agentic-AI as automation tools into real software delivery pipelines: what worked, what failed structurally, and why the failures were not random but predictable given how stochastic processing works. It examines the structural failure modes of Agentic-AI in software development and proposes a practical framework for working within those limits. The diagnostic section identifies three compounding problems: semantic and structural drift across iterative agent cycles; the absence of distributed verification (multiple agents from the same model produce consistent hallucination, not independent review); and the destructive modification effect, where the agent restructures code outside the requested scope because it has no concept of a local change. The solution framework consists of three interconnected mechanisms. First, a four-level information funnel (semantic → functional → admissibility → technical implementation), where lower levels are formally constrained by higher ones and violations are detectable deterministically. Second, a blueprint library — concrete, versioned reference implementations with explicit adaptation boundaries — as the primary artifact class for agent instructions, distinct from design patterns and generic good code. Third, an MCP code parser: a deterministic, grammar-aware static analysis tool exposed to the agent as an MCP server, producing a compact structural project map (map.json) with module signatures, dependency graph, and admissibility tags. Grammar parsers already exist for virtually every mainstream language (TypeScript Compiler API, Roslyn, tree-sitter, libcst); wrapping one into an MCP server following the template in §7 is a standard DevOps engineering task, not a research problem. The practical section provides 14 operational rules, including: honest zone classification (green/yellow/red) before the project starts; an ROI formula that accounts for oversight hours and regression cost, with a note that semantic debt adds 20–50% to regression costs over a project's lifetime; sandbox requirements for red-zone agent execution (no outbound network, read-only filesystem, least-privilege credentials); and a vendor liability analysis showing that current indemnification clauses from OpenAI, Anthropic, and Google explicitly exclude functional errors and regulatory non-compliance, leaving that liability with the deploying team. The appendix maps the framework to EADM (E.L.I.A. Architectural Design Methodology), where several of these mechanisms are already implemented as language-level constructs, including a grammar-aware MCP parser and a normative AI-assisted authoring protocol (Annex A) as a field-tested instance of the §7 pattern.
Building similarity graph...
Analyzing shared references across papers
Loading...
Chudinov Yurii (Fri,) studied this question.
www.synapsesocial.com/papers/6a095c5d7880e6d24efe264e — DOI: https://doi.org/10.5281/zenodo.20213067
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:
Chudinov Yurii
Building similarity graph...
Analyzing shared references across papers
Loading...