Proteomic alterations in ovarian cancer—Predicting residual disease status using artificial intelligence and SHAP-based biomarker interpretation


YAŞAR Ş., MELEKOĞLU R.

Frontiers in Medicine, cilt.12, 2025 (SCI-Expanded, Scopus) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 12
  • Basım Tarihi: 2025
  • Doi Numarası: 10.3389/fmed.2025.1562558
  • Dergi Adı: Frontiers in Medicine
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, EMBASE, Directory of Open Access Journals
  • Anahtar Kelimeler: high-grade serous ovarian cancer (HGSOC), machine learning, neoadjuvant chemotherapy (NACT), proteomic biomarkers, SHAP analysis
  • İnönü Üniversitesi Adresli: Evet

Özet

Introduction: High-grade serous ovarian cancer (HGSOC) is the most aggressive and prevalent subtype of ovarian Treatment outcomes are significantly influenced by residual disease status following neoadjuvant chemotherapy (NACT). Predicting residual disease before surgery can improve patient stratification and personalized treatment strategies. Methods: This study analyzed pre-NACT proteomic data from 20 HGSOC patients treated with NACT. Patients were categorized into two groups based on surgical outcomes: no residual disease (R0, n = 14) and suboptimal residual disease (R1, n = 6). From an initial set of 97 differentially expressed proteins, 18 significant proteins were selected using the BORUTA feature selection method. Three machine learning models-Random Forest (RF), Support Vector Machine (SVM), and Bootstrap Aggregation with Classification and Regression Trees (BaggedCART)-were developed and evaluated. Results: The Random Forest model achieved the best performance with an AUC of 0.955, accuracy of 0.830, sensitivity of 0.904, specificity of 0.763, and F1-score of 0.839. SHapley Additive exPlanations (SHAP) analysis identified five proteins (P48637, O43491, O95302, Q96CX2, and P49189) as the most influential predictors of residual disease. These proteins, including glutathione synthetase and peptidyl-prolyl cis-trans isomerase FKBP9, are associated with chemotherapy resistance mechanisms. Discussion: The findings demonstrate the potential of integrating proteomic data with machine learning techniques for predicting surgical outcomes in HGSOC. Identified protein signatures may serve as valuable biomarkers for anticipating NACT response and informing clinical decision-making, ultimately contributing to personalized patient care.