The quality and uncertainty of training samples are a critical bottleneck constraining the performance of machine learning in landslide susceptibility mapping (LSM). To address this, this study introduces a sample purity optimization framework based on state-process consistency (SPC). The framework utilizes two indices, SWCFD and CF-BE-HMD, combined with a mutually exclusive stratified purity strategy (MESPS), to systematically optimize the purity of the training samples. The framework was validated using data from Zhenxiong County, Yunnan Province, China. The results indicate that sample purity is the core variable influencing LSM model performance. The mutually exclusive sample group with the highest purity achieved AUC values of 0.92–0.93, markedly higher than those of other groups. High-purity landslide and non-landslide samples effectively suppressed the risk generalization effect caused by low-purity positive samples and overcame the risk conservatism problem induced by low-purity negative samples, resulting in clearer boundaries in the risk zonation. Therefore, only through bidirectional purification, which ensures that both positive and negative samples are highly representative, can these two extreme biases be maximally eliminated, leading to the generation of scientific, reliable, and balanced landslide susceptibility maps. This study offers a feasible solution to the sample quality bottleneck, providing a solid theoretical and empirical basis for shifting landslide modeling towards a quality-driven approach.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhu et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69ba43cb4e9516ffd37a55c2 — DOI: https://doi.org/10.1007/s10064-026-04863-w
Jiaying Zhu
Yuqian Yang
Shuangyun Peng
Bulletin of Engineering Geology and the Environment
Yunnan Normal University
Southwest Forestry University
State Forestry and Grassland Administration
Building similarity graph...
Analyzing shared references across papers
Loading...