An interpretable stacking ensemble learning framework based on multi-dimensional data for real-time prediction of drug concentration: The example of olanzapine

被引:11
|
作者
Zhu, Xiuqing [1 ,2 ]
Hu, Jinqing [1 ,2 ]
Xiao, Tao [1 ,3 ]
Huang, Shanqing [1 ,2 ]
Wen, Yuguan [1 ,2 ]
Shang, Dewei [1 ,2 ]
机构
[1] Guangzhou Med Univ, Affiliated Brain Hosp, Dept Pharm, Guangzhou, Peoples R China
[2] Guangdong Engn Technol Res Ctr Translat Med Mental, Guangzhou, Peoples R China
[3] Guangdong Second Prov Gen Hosp, Dept Clin Res, Guangzhou, Peoples R China
关键词
olanzapine; drug concentration; therapeutic drug monitoring; stacking; machine learning; electronic health record; interpretability; model-informed precision dosing; IMPACT; CHINA;
D O I
10.3389/fphar.2022.975855
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Background and Aim: Therapeutic drug monitoring (TDM) has evolved over the years as an important tool for personalized medicine. Nevertheless, some limitations are associated with traditional TDM. Emerging data-driven model forecasting [e.g., through machine learning (ML)-based approaches] has been used for individualized therapy. This study proposes an interpretable stacking-based ML framework to predict concentrations in real time after olanzapine (OLZ) treatment.Methods: The TDM-OLZ dataset, consisting of 2,142 OLZ measurements and 472 features, was formed by collecting electronic health records during the TDM of 927 patients who had received OLZ treatment. We compared the performance of ML algorithms by using 10-fold cross-validation and the mean absolute error (MAE). The optimal subset of features was analyzed by a random forest-based sequential forward feature selection method in the context of the top five heterogeneous regressors as base models to develop a stacked ensemble regressor, which was then optimized via the grid search method. Its predictions were explained by using local interpretable model-agnostic explanations (LIME) and partial dependence plots (PDPs).Results: A state-of-the-art stacking ensemble learning framework that integrates optimized extra trees, XGBoost, random forest, bagging, and gradient-boosting regressors was developed for nine selected features [i.e., daily dose (OLZ), gender_male, age, valproic acid_yes, ALT, K, BW, MONO#, and time of blood sampling after first administration]. It outperformed other base regressors that were considered, with an MAE of 0.064, R-square value of 0.5355, mean squared error of 0.0089, mean relative error of 13%, and ideal rate (the percentages of predicted TDM within & PLUSMN; 30% of actual TDM) of 63.40%. Predictions at the individual level were illustrated by LIME plots, whereas the global interpretation of associations between features and outcomes was illustrated by PDPs.Conclusion: This study highlights the feasibility of the real-time estimation of drug concentrations by using stacking-based ML strategies without losing interpretability, thus facilitating model-informed precision dosing.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] MULTI-PARAMETER PREDICTION FOR STEAM TURBINE BASED ON REAL-TIME DATA USING DEEP LEARNING APPROACHES
    Sun, Lei
    Liu, Tianyuan
    Xie, Yonghui
    Xia, Xinlei
    PROCEEDINGS OF ASME TURBO EXPO 2021: TURBOMACHINERY TECHNICAL CONFERENCE AND EXPOSITION, VOL 8, 2021,
  • [32] A Deep Learning Approach Based on Feature Reconstruction and Multi-dimensional Attention Mechanism for Drug-Drug Interaction Prediction
    Xie, Jiang
    Ouyang, Jiaming
    Zhao, Chang
    He, Hongjian
    Dong, Xin
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 400 - 410
  • [33] Multi-Source-Data-Oriented Ensemble Learning Based PM 2.5 Concentration Prediction in Shenyang
    Qi, Tianfang
    Jiang, Hongxun
    Shi, Xiaowen
    PROCEEDINGS OF THE 52ND ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2019, : 1284 - 1293
  • [34] Deep learning-based framework for real-time transient stability prediction under stealthy data integrity attacks
    Kesici, Mert
    Mohammadpourfard, Mostafa
    Aygul, Kemal
    Genc, Istemihan
    ELECTRIC POWER SYSTEMS RESEARCH, 2023, 221
  • [35] Real-time P300-based BCI in mechatronic control by using a multi-dimensional approach
    De Venuto, Daniela
    Annese, Valerio F.
    Mezzina, Giovanni
    IET SOFTWARE, 2018, 12 (05) : 418 - 424
  • [36] A spark-based big data analysis framework for real-time sentiment prediction on streaming data
    Kilinc, Deniz
    SOFTWARE-PRACTICE & EXPERIENCE, 2019, 49 (09): : 1352 - 1364
  • [37] AN ONLINE LOAD BALANCING SCHEDULING ALGORITHM FOR CLOUD DATA CENTERS CONSIDERING REAL-TIME MULTI-DIMENSIONAL RESOURCE
    Xu, Minxian
    Tian, Wenhong
    2012 IEEE 2ND INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENT SYSTEMS (CCIS) VOLS 1-3, 2012, : 264 - 268
  • [38] Hypertuning-Based Ensemble Machine Learning Approach for Real-Time Water Quality Monitoring and Prediction
    Bin Shahid, Md. Shamim
    Rifat, Habibur Rahman
    Uddin, Md Ashraf
    Islam, Md Manowarul
    Mahmud, Md. Zulfiker
    Sakib, Md Kowsar Hossain
    Roy, Arun
    APPLIED SCIENCES-BASEL, 2024, 14 (19):
  • [39] Real-time milk analysis integrated with stacking ensemble learning as a tool for the daily prediction of cheese-making traits in Holstein cattle
    Mota, Lucio F. M.
    Giannuzzi, Diana
    Bisutti, Vittoria
    Pegolo, Sara
    Trevisi, Erminio
    Schiavon, Stefano
    Gallo, Luigi
    Fineboym, David
    Katz, Gil
    Cecchinato, Alessio
    JOURNAL OF DAIRY SCIENCE, 2022, 105 (05) : 4237 - 4255
  • [40] Majority Vote of Ensemble Machine Learning Methods for Real-Time Epilepsy Prediction Applied on EEG Pediatric Data
    Jukic, Samed
    Keco, Dino
    Kevric, Jasmin
    TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2018, 7 (02): : 313 - 318