Key points are not available for this paper at this time.
The most expensive mistake in Agentic-AI SDLC is believing the agent understands. It doesn't understand — it guesses. This publication systematizes two years of field experience integrating AI-Agents and Agentic-AI as automation tools into real software delivery pipelines: what worked, what failed structurally, and why the failures were not random but predictable given how stochastic processing works. It examines the structural failure modes of Agentic-AI in software development and proposes a practical framework for working within those limits. The diagnostic section identifies three compounding problems: semantic and structural drift across iterative agent cycles; the absence of distributed verification (multiple agents from the same model produce consistent hallucination, not independent review); and the destructive modification effect, where the agent restructures code outside the requested scope because it has no concept of a local change. The solution framework consists of three interconnected mechanisms. First, a four-level information funnel (semantic → functional → admissibility → technical implementation), where lower levels are formally constrained by higher ones and violations are detectable deterministically. Second, a blueprint library — concrete, versioned reference implementations with explicit adaptation boundaries — as the primary artifact class for agent instructions, distinct from design patterns and generic good code. Third, an MCP code parser: a deterministic, grammar-aware static analysis tool exposed to the agent as an MCP server, producing a compact structural project map (map.json) with module signatures, dependency graph, and admissibility tags. Grammar parsers already exist for virtually every mainstream language (TypeScript Compiler API, Roslyn, tree-sitter, libcst); wrapping one into an MCP server following the template in §7 is a standard DevOps engineering task, not a research problem. The practical section provides 14 operational rules, including: honest zone classification (green/yellow/red) before the project starts; an ROI formula that accounts for oversight hours and regression cost, with a note that semantic debt adds 20–50% to regression costs over a project's lifetime; sandbox requirements for red-zone agent execution (no outbound network, read-only filesystem, least-privilege credentials); and a vendor liability analysis showing that current indemnification clauses from OpenAI, Anthropic, and Google explicitly exclude functional errors and regulatory non-compliance, leaving that liability with the deploying team. The appendix maps the framework to EADM (E.L.I.A. Architectural Design Methodology), where several of these mechanisms are already implemented as language-level constructs, including a grammar-aware MCP parser and a normative AI-assisted authoring protocol (Annex A) as a field-tested instance of the §7 pattern.
Building similarity graph...
Analyzing shared references across papers
Loading...
Chudinov Yurii
Building similarity graph...
Analyzing shared references across papers
Loading...
Chudinov Yurii (Fri,) studied this question.
www.synapsesocial.com/papers/6a095c5d7880e6d24efe264e — DOI: https://doi.org/10.5281/zenodo.20213067
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: