Accurate prediction of ultra-high-performance concrete (UHPC) compressive strength is essential for optimizing mixture design and reducing experimental iterations. Existing machine learning approaches suffer from limited algorithm diversity, insufficient statistical validation, and inadequate uncertainty quantification. This study presents a comprehensive framework through systematic evaluation of 20 algorithms across seven categories on 863 experimental observations. Six physically meaningful composite features (such as water-cement ratio, total binder content, and fiber aspect ratio) are engineered to capture intrinsic material relationships, with the Boruta algorithm employed for feature selection. Statistical robustness is ensured through 30 repeated experiments analyzed using both frequentist (p-value, effect size, 95% CI) and Bayesian approaches. CatBoost achieves optimal performance (R2 = 0.8979 ± 0.0239, RMSE = 10.58 ± 1.45 MPa), with curing age, sand content, and steel fiber volume identified as dominant predictors through multi-perspective interpretability analysis integrating SHAP, ALE, permutation importance, and LIME. External validation on 810 independent samples yields R2 = 0.5923 (RMSE = 25.68 MPa) under significant cross-dataset conditions, with performance reduction attributed to feature availability differences and distribution shift. Comprehensive uncertainty quantification yields prediction uncertainty of 3.48%, substantially below previously reported thresholds. The proposed framework offers practitioners a reliable tool for UHPC mixture screening while maintaining prediction confidence for structural engineering applications.
Huang et al. (Thu,) studied this question.