Sustainable groundwater management in hyper-arid regions requires accurate water quality assessments, yet remote desert environments present major challenges due to data scarcity, high sampling costs, and limited laboratory infrastructure. This study proposes a framework integrating the Water Quality Index (WQI) with Inverse Distance Weighting (IDW)-based spatial data augmentation and machine learning classification for groundwater quality assessment in the Tabelbala region, southwestern Algeria. Three classifiers were evaluated, Random Forest (RF), Support Vector Machines (SVMs), and Artificial Neural Networks (ANNs), and trained on an augmented dataset generated from 178 original groundwater samples using IDW interpolation with a sensitivity-optimized 150 m radius, producing 2779 augmented training points. RF achieved the highest predictive accuracy (85.9%), followed by ANNs (84.7%) and SVMs (83.1%), with all models demonstrating excellent discriminative performances (area under the receiver operating characteristic curve > 0.96). Permutation Feature Importance analysis identified total dissolved solids (TDS), sulfates (SO42−), total hardness (TH), and chlorides (Cl−) as the most influential parameters, consistent with World Health Organization (WHO) guidelines. Spatial distribution maps revealed that the majority of groundwater sources exhibited poor to very poor quality, highlighting the urgent need for local water management interventions. The proposed framework offers a replicable decision-support tool for water resource managers in data-scarce arid environments, supporting SDG 6 (Clean Water and Sanitation) and SDG 13 (Climate Action).
Farhi et al. (Thu,) studied this question.