March 3, 2026Open Access

Explainable Machine Learning Models SHAP-based for Feature Importance Affecting Stunting Prevalence

Key Points

Stunting prevalence classification accuracy achieved 90.00% using the random forest model, representing a significant finding.
SHAP (Shapley Additive Explanations) provides clarity on feature importance, revealing underweight as a key predictor.
Analysis employing machine learning models like decision trees and support vector machines enhances predictive power.
These insights underline the necessity for interpretability in machine learning to inform public health policies.

Abstract

Stunting is a form of chronic nutritional deficiency in toddlers and remains a major public health concern due to its impact on child growth and development. Efforts to reduce its prevalence continue to be strengthened in Indonesia, particularly in Sumatra Province. This study aims to evaluate the accuracy of a logistic regression model and three machine learning models—decision tree, random forest, and support vector machine (SVM)—in classifying stunting prevalence. The response variable is defined as the prevalence of stunting among toddlers, categorized into two classes: exceeding the national target and not exceeding the national target, based on the 2024 national threshold. Although classification models can provide accurate predictions, they often lack interpretability. Therefore, this study applies the SHAP method to the best-performing machine learning model to identify the key factors influencing stunting. The use of Shapley values is justified through the uniqueness theorem, which establishes it as the only attribution method satisfying desirable fairness properties. SHAP values are employed to explain the model by referencing both the trained model and the underlying data. The results show that the random forest model achieves the highest accuracy (90.00%), outperforming the other models. SHAP analysis reveals that Underweight is the most influential predictor contributing to stunting prevalence in Sumatra Province. These findings highlight the relevance of machine learning interpretability in supporting policy decisions for stunting reduction.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Asysta Pasaribu

Nur Fitriyani Sahamony

KHAIRIL ANWAR NOTODIPUTRO

Journals

SHILAP Revista de lepidopterología

Actions

Institutions

IPB University

Binus University

Universitas Bina Darma

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Explainable Machine Learning Models SHAP-based for Feature Importance Affecting Stunting Prevalence

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study