Accurate mapping of near-surface air temperature (Tair) at the fine spatial resolution is required for city-scale monitoring and remains a critical challenge in Earth Observation (EO). Reliance on ground-based measurements is constrained by their sparse spatial coverage and high operational costs. We present a novel, scalable machine learning framework designed to overcome this limitation. Our method utilizes interpretable Convolutional Neural Networks (CNNs) to fuse high-resolution Landsat data, integrating both thermal and reflective spectral bands, with contextual spatiotemporal metadata. This approach allows for inference, at 30 m resolution, of Tair fields without relying on dense, localized ground monitoring networks. Our hybrid CNN architecture is optimized for spatial generalization, maintaining strong and transferable performance (station-wise R2≈0.88) across diverse environments from humid coasts (R2≈0.89) to arid interiors (R2≈0.84). Although focused on a specific geographical region, our results suggest a robust and reproducible pathway for generating spatially consistent temperature fields from globally available EO archives, directly supporting urban heat island mitigation, climate policy development, and high-resolution public health assessment worldwide.
Eitan et al. (Sun,) studied this question.