What question did this study set out to answer?

This research aims to compare various machine learning and deep learning models for predicting groundwater levels in karst aquifers.

April 16, 2026Open Access

Benchmarking Machine Learning and Deep Learning Models for Groundwater Level Prediction in Karst Aquifers: The Dominant Role of Hydrogeological Complexity

Puntos clave

This research aims to compare various machine learning and deep learning models for predicting groundwater levels in karst aquifers.
Evaluated nine machine learning and deep learning models (e.g., RF, LSTM, Transformer)
Used two years of hourly data from three distinct hydrogeological zones
Applied a multidimensional evaluation framework focusing on accuracy, stability, and efficiency
Transformer achieved the highest single-step prediction accuracy (R2 = 0.813–0.965, RMSE = 0.130–0.606 m)
CNN-LSTM balanced predictive performance and cost with an average training time of 27.97 s
N-BEATS excelled in long-term stability with R2 = 0.914 in 12-step forecasting at ZK1
Hydrogeological complexity significantly influenced prediction capability, exceeding model architecture effects.

Resumen

Karst aquifers present unique challenges for groundwater level prediction due to their dual-porosity structures and highly nonlinear hydrological responses. This study systematically evaluates nine machine learning and deep learning models (RF, XGBoost, LSTM, CNN, Transformer, N-BEATS, CNN-LSTM, Seq2Seq-LSTM, and Attention-Seq2Seq-LSTM) for rainfall-driven groundwater level forecasting in the Maocun subterranean river catchment, Guilin, Guangxi, China. Two years of hourly high-frequency data from three monitoring sites representing distinct hydrogeological zones (recharge, flow, and discharge) were employed within a multidimensional evaluation framework integrating single-step accuracy, multi-step stability, and computational efficiency. Results indicate that the Transformer achieved the highest single-step prediction accuracy, attaining the lowest RMSE (0.130–0.606 m) and highest R2 (0.813–0.965) across all three sites. CNN-LSTM offered the best balance between predictive performance and computational cost, requiring an average training time of only 27.97 s and 28.0 convergence epochs. N-BEATS demonstrated superior long-term stability in 12-steps-ahead forecasting, achieving R2 = 0.914 at ZK1, outperforming all other architectures. More fundamentally, hydrogeological complexity exerted a dominant control on predictive skill that systematically outweighed differences arising from model architecture. All models yielded R2 below 0.813 at the geologically complex ZK2 site, whereas R2 exceeded 0.950 across all models at ZK1, indicating that aquifer complexity, rather than algorithm selection, constitutes the primary constraint on prediction feasibility. This study presents the first application of N-BEATS to karst groundwater level forecasting and proposes a replicable multidimensional evaluation framework, providing a scientific reference for intelligent modelling of complex karst systems.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo

Cite This Study

Zhu et al. (Tue,) studied this question.

synapsesocial.com/papers/69e07d8f2f7e8953b7cbe8d3 https://doi.org/https://doi.org/10.3390/w18080939

Me gusta

Guardar

Ver artículo completo