High-accuracy spatiotemporal monitoring of surface nitrogen dioxide (NO2) concentrations is essential for air quality management. This study evaluates machine learning-based estimates of near-surface NO2 concentrations using data from the geostationary GEMS instrument and the polar-orbiting TROPOMI over China in 2022. Four tree-based models—Random Forest, XGBoost, CatBoost, and LightGBM—were trained by integrating satellite vertical-column densities with multi-source meteorological and ancillary data. Results show that CatBoost achieved the highest accuracy, with an R2 of 0.842 for GEMS and 0.765 for TROPOMI, alongside the lowest RMSE and MAE. Models trained on GEMS data consistently outperformed TROPOMI-based models across all metrics. This advantage is primarily attributed to the substantially larger training sample size enabled by GEMS’s high temporal resolution, as confirmed through a controlled experiment with consistent sample sizes which isolated the effect of data volume. Spatially, GEMS estimates captured sharper concentration gradients and localized emission hotspots, while TROPOMI produced smoother fields. Temporally, only GEMS allowed the reconstruction of detailed diurnal patterns and near-real-time pollution episode tracking. This study confirms the significant added value of geostationary satellite data for high-frequency air quality monitoring and analysis when combined with machine learning.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yijin Ma
Yi Wang
Jun Wang
Building similarity graph...
Analyzing shared references across papers
Loading...
Ma et al. (Sun,) studied this question.
www.synapsesocial.com/papers/6994058c4e9c9e835dfd685b — DOI: https://doi.org/10.3390/rs18040614