Predicting sepsis in-hospital mortality with machine learning: a multi-center study using clinical and inflammatory biomarkers

This study aimed to develop and validate an interpretable machine-learning model that utilizes clinical features and inflammatory biomarkers to predict the risk of in-hospital mortality in critically ill patients suffering from sepsis. We enrolled all patients diagnosed with sepsis in the Medical In...

Full description

Saved in:

Bibliographic Details
Published in:	European journal of medical research Vol. 29; no. 1; p. 156
Main Authors:	Zhang, Guyu, Shao, Fei, Yuan, Wei, Wu, Junyuan, Qi, Xuan, Gao, Jie, Shao, Rui, Tang, Ziren, Wang, Tao
Format:	Journal Article
Language:	English
Published:	England BioMed Central Ltd 06-03-2024 BioMed Central BMC
Subjects:	Algorithms Biological markers Biomarkers Blood pressure Cholesterol High density lipoprotein HIV Hospitals Human immunodeficiency virus Infection Infections Inflammation Intensive care unit Lipoproteins Lymphocytes Machine learning Machining learning Medical advice systems Medical centers Medical prognosis Metastasis Mortality Neutrophils Parameter estimation Pathophysiology Patient outcomes Performance evaluation Prediction Regression analysis Rheumatic diseases Risk factors Sepsis Structured Query Language-SQL Tumors Variables XGBoost Sepsis Machining learning XGBoost Intensive care unit Prediction
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	This study aimed to develop and validate an interpretable machine-learning model that utilizes clinical features and inflammatory biomarkers to predict the risk of in-hospital mortality in critically ill patients suffering from sepsis. We enrolled all patients diagnosed with sepsis in the Medical Information Mart for Intensive Care IV (MIMIC-IV, v.2.0), eICU Collaborative Research Care (eICU-CRD 2.0), and the Amsterdam University Medical Centers databases (AmsterdamUMCdb 1.0.2). LASSO regression was employed for feature selection. Seven machine-learning methods were applied to develop prognostic models. The optimal model was chosen based on its accuracy, F1 score and area under curve (AUC) in the validation cohort. Moreover, we utilized the SHapley Additive exPlanations (SHAP) method to elucidate the effects of the features attributed to the model and analyze how individual features affect the model's output. Finally, Spearman correlation analysis examined the associations among continuous predictor variables. Restricted cubic splines (RCS) explored potential non-linear relationships between continuous risk factors and in-hospital mortality. 3535 patients with sepsis were eligible for participation in this study. The median age of the participants was 66 years (IQR, 55-77 years), and 56% were male. After selection, 12 of the 45 clinical parameters collected on the first day after ICU admission remained associated with prognosis and were used to develop machine-learning models. Among seven constructed models, the eXtreme Gradient Boosting (XGBoost) model achieved the best performance, with an AUC of 0.94 and an F1 score of 0.937 in the validation cohort. Feature importance analysis revealed that Age, AST, invasive ventilation treatment, and serum urea nitrogen (BUN) were the top four features of the XGBoost model with the most significant impact. Inflammatory biomarkers may have prognostic value. Furthermore, SHAP force analysis illustrated how the constructed model visualized the prediction of the model. This study demonstrated the potential of machine-learning approaches for early prediction of outcomes in patients with sepsis. The SHAP method could improve the interoperability of machine-learning models and help clinicians better understand the reasoning behind the outcome.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	2047-783X 0949-2321 2047-783X
DOI:	10.1186/s40001-024-01756-0