Soil heavy metal pollution caused by mineral resource extraction activities poses a serious threat to the ecological environment within and surrounding mining areas. As a highly concealed toxic heavy metal, arsenic (As) urgently requires the establishment of efficient pollution monitoring methods to achieve pollution prevention and control, as well as environmental remediation in mining areas. This study investigated the feasibility of hyperspectral remote sensing inversion for soil heavy metal arsenic based on ZY-1 02E hyperspectral satellite imagery, focusing on a mining area and its surrounding soils in Sanlisha’an, Wuxuan County, Guangxi. Full Constrained Least Squares (FCLS) was employed to separate mixed pixels and enhance soil spectral contributions in ZY-1 02E imagery, thereby mitigating vegetation interference. Six mathematical transformations, including RT, AT, FD, RTFD, ATFD, and SD, were applied to both the original and enhanced spectra to enhance spectral features. The correlations between the transformed spectra, as well as the original image spectra (S), and soil As concentration were analyzed; then the spectra strongly correlated with soil As concentration were selected to construct Ratio Spectral Index (RSI) and Normalized Difference Spectral Index (NDSI). Correlation matrices were calculated between RSI/NDSI indices and As concentration. Sensitive features were screened using an improved Successive Projection Algorithm (SPA). As concentration inversion was also performed with four models: traditional regression models, PLSR and MLR, and ensemble learning models (RF and XGBoost). In the soil contribution-enhanced spectral modeling results, the optimal transformation–index combination is ATFD-NDSI. The performance indicators of each model are as follows: MLR test set R2 = 0.65, PLSR test set R2 = 0.62, RF test set R2 = 0.7, and XGBoost test set R2 = 0.64. The results indicate that the ATFD-NDSI-RF ensemble model provides the best performance. By integrating multiple decision trees, RF effectively handles complex nonlinear relationships, thus enhancing the accuracy and generalization ability of predication. The analysis of NDSI–ATFD–RF inversion results based on sampling points indicates that model error correlates with the pollution intensity gradient, showing greater errors, especially in high-concentration areas, but still maintaining strong correlations (tailings reservoir: r = 0.92, forested areas: r = 0.96, and cropland: r = 0.83). The spatial distribution reveals that the inversion results are closely similar to the spatial distribution of IDW interpolation. Areas with high As concentrations are concentrated in the tailings reservoir and in the southeastern part of the study area. The correlation coefficient between the inversion results and IDW interpolation is 0.6, which further verifies that the inversion results effectively reproduce the spatial distribution trend of highly polluted areas.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yuqin Li
System Simulation (United Kingdom)
Dan Meng
System Simulation (United Kingdom)
Qi Yang
System Simulation (United Kingdom)
Remote Sensing
Capital Normal University
System Simulation (United Kingdom)
Building similarity graph...
Analyzing shared references across papers
Loading...
Li et al. (Fri,) studied this question.
synapsesocial.com/papers/69ada8c2bc08abd80d5bbf7d — DOI: https://doi.org/10.3390/rs18050822