What question did this study set out to answer?

The research aims to evaluate machine learning models for predicting sepsis-related outcomes with a focus on trust and explainability.

March 18, 2026Open Access

Responsible AI for Sepsis Prediction: Bridging the Gap Between Machine Learning Performance and Clinical Trust

Key Points

The research aims to evaluate machine learning models for predicting sepsis-related outcomes with a focus on trust and explainability.
Evaluated multiple machine learning architectures using the MIMIC-IV database.
Predicted hospital mortality, length of stay, and septic shock onset with AI algorithms.
Applied SHAP for model interpretability assessment.
XGBoost achieved an AUROC of 0.874 for hospital mortality prediction, outperforming other models.
Model interpretability confirmed the clinical relevance of predictive variables.

Abstract

Background: Sepsis remains a leading cause of mortality in intensive care units (ICUs) worldwide. Machine learning models for clinical prediction must be accurate, fair, transparent, and reliable to ensure that physicians feel confident in their decision-making processes. Methods: We used the MIMIC-IV (version 3.1) database to evaluate several machine learning architectures, including Logistic Regression, XGBoost, LightGBM, LSTM (Long Short-Term Memory) networks and Transformer models. We predicted three main clinical targets—hospital mortality, length of stay, and septic shock onset—using artificial intelligence algorithms, with respect for responsible AI principles. Model interpretability was assessed using Shapley Additive Explanations (SHAP). Results: The XGBoost model demonstrated superior performance in prediction tasks, particularly for hospital mortality (AUROC 0.874), outperforming traditional LSTM networks, Transformers, and linear baselines. The importance analysis of the variables confirmed the clinical relevance of the model. Conclusions: While XGBoost and ensemble algorithms demonstrate superior predictive power for sepsis prognosis, their clinical adoption necessitates robust explainability mechanisms to gain trust among doctors.

Responsible AI for Sepsis Prediction: Bridging the Gap Between Machine Learning Performance and Clinical Trust

Key Points

Abstract

Cite This Study