What question did this study set out to answer?

This study aims to optimize input variable combinations for predicting groundwater levels using machine learning.

May 9, 2026Open Access

Groundwater Level Prediction with Optimized Input Variable Combinations Using GS-LSTM and TOPSIS

Key Points

This study aims to optimize input variable combinations for predicting groundwater levels using machine learning.
Developed a hybrid framework integrating GS-LSTM and TOPSIS for input combination evaluation.
Assessed 30 input combinations derived from precipitation, air temperature, relative humidity, wind speed, and evapotranspiration across 22 monitoring wells.
Compared prediction performance using daily versus monthly averaged data across varied hyperparameter settings.
Optimal input combinations show substantial variability among wells, indicating spatial heterogeneity.
Increased prediction success rate by over 40% and improved R2 by greater than 0.3 when using monthly averaged data.
Identified a minimal set of eight input combinations that maintain high accuracy while reducing computational cost by 92.1% compared to all combinations.

Abstract

Groundwater level prediction is essential for sustainable water resource management. Although machine learning models are widely applied, input variable selection critically affects predictive performance, and existing studies rarely evaluate model performance comprehensively, considering accuracy, stability, physical interpretability, and computational efficiency. To address this issue, this study develops a hybrid framework integrating grid search-optimized long short-term memory (GS-LSTM) with the technique for order preference by similarity to ideal solution (TOPSIS). Using the Houston area as a case study, the framework evaluates 30 input combinations derived from precipitation (P), air temperature (T), relative humidity (H), wind speed (W), and reference evapotranspiration (E) across 22 monitoring wells to identify optimal and minimal input variable combinations sets. Key findings include: (1) optimal input combinations vary substantially among wells, highlighting spatial heterogeneity; (2) P and E are dominant drivers; (3) compared to daily input data, monthly averaged data increases the prediction success rate (proportion of successful runs across 27 hyperparameter configurations) by >40% and improves R2 by >0.3; (4) the minimal set comprises eight representative combinations that collectively cover the top-three ranked variable combinations for all 22 wells, maintaining high accuracy (e.g., Well 12# daily data: MAE = 0.13 m, RMSE = 0.16 m, R2 = 0.92) while reducing computational cost by 92.1% relative to testing all 30 combinations. The proposed optimal and minimal input sets offer a stable, accurate, and computationally efficient solution for groundwater resource management that accounts for spatial heterogeneity.

Groundwater Level Prediction with Optimized Input Variable Combinations Using GS-LSTM and TOPSIS

Key Points

Abstract

Cite This Study