What question did this study set out to answer?

This study aims to develop a fast and interpretable method for predicting leakage in buried pipelines using machine learning techniques.

May 31, 2026Open Access

Leakage Concentration Prediction and Interpretable Analysis of Buried Pipelines Based on Multi-Layer Perceptron and Interval Sampling

Key Points

This study aims to develop a fast and interpretable method for predicting leakage in buried pipelines using machine learning techniques.
Integrated machine learning with interval sampling to predict leakage from pipeline data.
Extracted 140 million samples from 1.4 billion CFD data points using 1:10 interval sampling.
Trained a Multi-Layer Perceptron along with XGBoost and LightGBM models using 17 physical features.
MLP model achieved R2 = 0.9988 and RMSE = 0.0153, significantly outperforming tree-based models (R2 ≈ 0.93).
Robustness was confirmed through independent sampling runs, with R2 coefficient of variation ~0%.
SHAP analysis revealed spatial coordinates and leak aperture as key influential factors, indicating the nuanced effects of other features.

Abstract

Buried-pipeline leakage poses significant safety risks, yet traditional CFD (Computational Fluid Dynamics) simulations are too slow for real-time diagnosis. This study integrates machine learning with interval sampling to develop a fast and interpretable prediction method. From 1.4 billion CFD-generated data points, 140 million representative samples were extracted via 1:10 interval sampling. Using 17 physical features as inputs, we trained and compared XGBoost, LightGBM, and a Multi-Layer Perceptron (MLP). The MLP model demonstrated exceptional performance (R2 (R-squared) = 0.9988, RMSE (Root Mean Square Error) = 0.0153), significantly outperforming the tree-based models (R2 ≈ 0.93). Three independent sampling runs confirmed its robustness (R2 coefficient of variation~0%). SHAP (Shapley Additive Explanations) analysis identified spatial coordinates and leak aperture as the most critical factors, while also revealing the nonlinear influence of soil particle size. This approach offers a high-precision, interpretable, and efficient surrogate model for buried-pipeline leakage warning systems.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper