Automated Machine Learning and Explainable AI (AutoML-XAI) for Metabolomics: Improving Cancer Diagnostics

被引:4
|
作者
Bifarin, Olatomiwa O. [1 ]
Fernandez, Facundo M. [1 ,2 ]
机构
[1] Georgia Inst Technol, Sch Chem & Biochem, Atlanta, GA 30332 USA
[2] Georgia Inst Technol, Petit Inst Bioengn & Biosci, Atlanta, GA 30332 USA
关键词
metabolomics; automated machine learning; explainableAI; cancer biology; Shapley additive explanations;
D O I
10.1021/jasms.3c00403
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Metabolomics generates complex data necessitating advanced computational methods for generating biological insight. While machine learning (ML) is promising, the challenges of selecting the best algorithms and tuning hyperparameters, particularly for nonexperts, remain. Automated machine learning (AutoML) can streamline this process; however, the issue of interpretability could persist. This research introduces a unified pipeline that combines AutoML with explainable AI (XAI) techniques to optimize metabolomics analysis. We tested our approach on two data sets: renal cell carcinoma (RCC) urine metabolomics and ovarian cancer (OC) serum metabolomics. AutoML, using Auto-sklearn, surpassed standalone ML algorithms like SVM and k-Nearest Neighbors in differentiating between RCC and healthy controls, as well as OC patients and those with other gynecological cancers. The effectiveness of Auto-sklearn is highlighted by its AUC scores of 0.97 for RCC and 0.85 for OC, obtained from the unseen test sets. Importantly, on most of the metrics considered, Auto-sklearn demonstrated a better classification performance, leveraging a mix of algorithms and ensemble techniques. Shapley Additive Explanations (SHAP) provided a global ranking of feature importance, identifying dibutylamine and ganglioside GM(d34:1) as the top discriminative metabolites for RCC and OC, respectively. Waterfall plots offered local explanations by illustrating the influence of each metabolite on individual predictions. Dependence plots spotlighted metabolite interactions, such as the connection between hippuric acid and one of its derivatives in RCC, and between GM3(d34:1) and GM3(18:1_16:0) in OC, hinting at potential mechanistic relationships. Through decision plots, a detailed error analysis was conducted, contrasting feature importance for correctly versus incorrectly classified samples. In essence, our pipeline emphasizes the importance of harmonizing AutoML and XAI, facilitating both simplified ML application and improved interpretability in metabolomics data science.
引用
收藏
页码:1089 / 1100
页数:12
相关论文
共 50 条
  • [21] Explainable AI: Machine Learning Interpretation in Blackcurrant Powders
    Przybyl, Krzysztof
    SENSORS, 2024, 24 (10)
  • [22] Structure Learning and Hyperparameter Optimization Using an Automated Machine Learning (AutoML) Pipeline
    Filippou, Konstantinos
    Aifantis, George
    Papakostas, George A.
    Tsekouras, George E.
    INFORMATION, 2023, 14 (04)
  • [23] Explainable AI: A Review of Machine Learning Interpretability Methods
    Linardatos, Pantelis
    Papastefanopoulos, Vasilis
    Kotsiantis, Sotiris
    ENTROPY, 2021, 23 (01) : 1 - 45
  • [24] Machine Learning for Fab Automated Diagnostics
    Giollo, Manuel
    Lam, Auguste
    Gkorou, Dimitra
    Liu, Xing Lan
    van Haren, Richard
    33RD EUROPEAN MASK AND LITHOGRAPHY CONFERENCE, 2017, 10446
  • [25] Enhancing Thyroid Cancer Diagnostics Through Hybrid Machine Learning and Metabolomics Approaches
    Raj, Meghana G.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (02) : 282 - 291
  • [26] Comparison of automated machine learning (AutoML) libraries in time series forecasting
    Akkurt, Nagihan
    Hasgui, Servet
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2024, 39 (03): : 1693 - 1701
  • [27] Automated Machine Learning (AutoML) for All Business Majors and Levels Workshop
    Larsen, Kai R.
    AMCIS 2018 PROCEEDINGS, 2018,
  • [28] Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools
    Anh Truong
    Walters, Austin
    Goodsitt, Jeremy
    Hines, Keegan
    Bruss, C. Bayan
    Farivar, Reza
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1471 - 1479
  • [29] Hierarchical automated machine learning (AutoML) for advanced unconventional reservoir characterization
    Yousef Mubarak
    Ardiansyah Koeshidayatullah
    Scientific Reports, 13 (1)
  • [30] Explainable Artificial Intelligence (XAI) in glaucoma assessment: Advancing the frontiers of machine learning algorithms
    Nimmy, Sonia Farhana
    Hussain, Omar K.
    Chakrabortty, Ripon K.
    Saha, Sajib
    KNOWLEDGE-BASED SYSTEMS, 2025, 316