Explainable Boosting Machine in Sepsis Prediction Using Platelet Metabolomics: An Interpretable Machine Learning Approach
Diagnostics, cilt.16, sa.11, 2026 (SCI-Expanded, Scopus)
- Yayın Türü: Makale / Tam Makale
- Cilt numarası: 16 Sayı: 11
- Basım Tarihi: 2026
- Doi Numarası: 10.3390/diagnostics16111643
- Dergi Adı: Diagnostics
- Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, EMBASE, Directory of Open Access Journals, Academic Search Ultimate (EBSCO), Biomedical Reference Collection: Corporate Edition (EBSCO)
- Anahtar Kelimeler: biomarkers, clinical decision support, explainable boosting machine, machine learning, metabolomics, platelet metabolism, sepsis
- İnönü Üniversitesi Adresli: Evet
Özet
Background: Sepsis remains a leading cause of mortality in emergency and intensive care settings, with early diagnosis representing a critical determinant of patient outcomes. Despite advances in biomarker discovery, integrating platelet-derived metabolic signatures with explainable machine learning frameworks for sepsis prediction remains underexplored. The clinical adoption of predictive models has been hindered by the “black box” nature of conventional algorithms, limiting clinician trust and understanding. Objective: This study aimed to evaluate and validate an interpretable machine learning model utilizing platelet metabolomics data for accurate sepsis prediction while providing clinically meaningful explanations of the underlying metabolic disturbances that could inform therapeutic decision-making. Methods: We analyzed metabolomics data, comprising 25 sepsis patients diagnosed according to Sepsis-3 criteria and 14 age- and gender-matched non-sepsis from the emergency department. Platelet metabolite profiles were obtained via quantitative 1H-NMR spectroscopy. Five machine learning algorithms were evaluated: Explainable Boosting Machine (EBM), Support Vector Machine (SVM), Logistic Regression (LR), Gradient Boosting Machine (GBM), and AdaBoost. Three biologically motivated metabolite ratios (adenosine triphosphate/adenosine diphosphate (ATP/ADP), ATP/adenosine monophosphate (AMP), Glutamine/Glutamate) were derived as additional features, yielding 22 candidate variables. Models were evaluated using a fully nested leave-one-out cross-validation (LOOCV) framework in which log transformation, KNN imputation, BorderlineSMOTE class balancing, and hyperparameter optimisation were performed exclusively within each training fold. Global and local interpretability analyses were performed to identify discriminative metabolites. Results: EBM achieved the highest ROC-AUC (0.864; 95% CI: 0.736–1.000), the highest PR-AUC (0.902; 95% CI: 0.783–0.997), and the best Brier score (0.189; 95% CI: 0.130–0.258) among all evaluated models, with sensitivity 0.880 (95% CI: 0.640–1.000; TP = 22/25) and specificity 0.714 (95% CI: 0.357–1.000; TN = 10/14). Global feature importance identified Carnitine, myo-Inositol, ADP, and O-Phosphoethanolamine as the leading single-feature predictors, alongside three pairwise interaction terms reflecting non-additive energy–amino acid metabolic relationships. Local explanations demonstrated that the ADP–Creatine interaction, Glutamine, and myo-Inositol drove correct sepsis classification in a representative true positive case. Conclusions: The EBM model demonstrated the highest discriminative performance and best calibration among all evaluated models, providing transparent mechanistic insights through global feature importance, and patient-level local explanations. These findings position the proposed framework as a proof-of-concept warranting external validation in larger, multi-centre cohorts before any clinical application is considered.