What question did this study set out to answer?

To develop a dynamic framework for generating emergency operation schemes (EOS) that adapt to urban rail transit train-door failures.

March 25, 2026Open Access

Emergency Operation Scheme Generation for Urban Rail Transit Train Door Systems Using Retrieval-Augmented Large Language Models

Key Points

To develop a dynamic framework for generating emergency operation schemes (EOS) that adapt to urban rail transit train-door failures.
Proposed a retrieval-augmented large language model framework for EOS generation.
Normalizing multi-source incident evidence into a structured representation.
Utilized a hybrid retriever combining dense and BM25 retrieval methods for selecting regulatory clauses.
Fine-tuned a generator with structured objectives to ensure compliance and grounding.
Achieved Recall@5 of 0.78 and Coverage@B of 0.71 in retrieval quality.
Demonstrated improved operational usability with SchemaPass of 0.88 and UsableAns of 0.83.
Traditional methods showed significantly lower usability, with pure LLM baseline at 0.15.

Abstract

Urban rail transit (URT) train-door failures are safety-critical and can cause cascading service disruptions, yet existing emergency operation schemes (EOSs) are often static, difficult to adapt to evolving fault patterns, and hard to verify against updated regulations. This study proposes a retrieval-augmented large language model (LLM) framework for executable and evidence-traceable EOS generation. Multi-source heterogeneous incident evidence (structured work orders, operational impact records, and unstructured maintenance/dispatch narratives) is normalized into a structured incident representation, and a hybrid retriever (dense + BM25) with cross-encoder reranking selects compact regulatory clauses and historical cases under a fixed context budget. The generator is fine-tuned with structured objectives to enforce schema compliance, role assignment, and citation grounding. Experiments on 776 passenger-door incidents from Shanghai URT (2019–2024) show that Hybrid + rerank achieves the best retrieval quality (Recall@5 = 0.78; Coverage@B = 0.71; FirstHit/B = 0.46). For generation, the full setting improves operational usability, reaching SchemaPass = 0.88, RoleAcc = 0.91, CiteCov = 0.73, and UsableAns = 0.83, compared with 0.15 UsableAns for a pure LLM baseline and 0.26 for prompting with RAG only. These results indicate that combining high-utility retrieval with structure- and citation-aware fine-tuning substantially improves the executability and verifiability of safety-critical operation schemes.

Emergency Operation Scheme Generation for Urban Rail Transit Train Door Systems Using Retrieval-Augmented Large Language Models

Key Points

Abstract

Cite This Study