With accelerating global urbanization and increasingly diverse demands for public spaces, promoting urban low-carbon transitions and enhancing residents’ quality of life have become central missions of modern urban development. As one of the city’s primary arteries, streets—through their green landscapes, slow-moving transportation systems, and public facilities—play an indispensable role in reducing carbon emissions, promoting healthy living, and improving residents’ well-being. In this study, the Yubei District of Chongqing was selected as the research area, and an automated evaluation framework was proposed for street visual quality, based on multi-source street view data and ensemble learning. PSP-Net semantic segmentation model was employed to extract eight key visual indicators from street view images, including green view index, Visual Entropy (Entropy), sky view factor (SVF), drivable space, sidewalk, safety facilities, buildings, and enclosure. Based on these features, a Stacking-based ensemble learning model was constructed, integrating multiple base models such as Random Forest, XGBoost, and LightGBM, with Linear Regression as the meta-learner, to predict street visual quality. The results demonstrate that the ensemble model significantly outperforms any single model, achieving a correlation coefficient (r) of 0.77 and effectively capturing the complex perceptual features of street environments. This study provides a reliable, intelligent, and quantitative method for large-scale evaluation of urban street visual quality, while supplying data support and decision-making references for street renewal and spatial optimization.
Wang et al. (Mon,) studied this question.