Novel Machine Learning Identifies 5 Asthma Phenotypes Using Cluster Analysis of Real-World Data

被引:1
|
作者
Wu, Chao-Ping [1 ]
Sleiman, Joelle [2 ]
Fakhry, Battoul [2 ]
Chedraoui, Celine [2 ]
Attaway, Amy [1 ,2 ]
Bhattacharyya, Anirban [3 ]
Bleecker, Eugene R. [4 ]
Erdemir, Ahmet [1 ]
Hu, Bo [1 ]
Kethireddy, Shravan [2 ]
Meyers, Deborah A. [3 ,4 ]
Rashidi, Hooman H. [4 ,5 ]
Zein, Joe G. [3 ,4 ]
机构
[1] Cleveland Clin, Resp Inst, Cleveland, OH USA
[2] Cleveland Clin, Lerner Res Inst, Cleveland, OH USA
[3] Mayo Clin, Dept Med, Jacksonville, FL USA
[4] Mayo Clin, Dept Med, Div Pulm Med, Scottsdale, AZ 85259 USA
[5] Cleveland Clin, Pathol & Lab Med Inst, Cleveland, OH USA
基金
美国国家卫生研究院;
关键词
Asthma; Machine learning; Asthma phenotypes; Cluster analysis; EXPRESSION; SEVERITY; NETWORK; COHORT;
D O I
10.1016/j.jaip.2024.04.035
中图分类号
R392 [医学免疫学];
学科分类号
100102 ;
摘要
BACKGROUND: Asthma classification fi cation into different subphenotypes is important to guide personalized therapy and improve outcomes. OBJECTIVES: To further explore asthma heterogeneity through determination of multiple patient groups by using novel machine learning (ML) approaches and large-scale real-world data. METHODS: We used electronic health records of patients with asthma followed at the Cleveland Clinic between 2010 and 2021. We used k-prototype unsupervised ML to develop a clustering model where predictors were age, sex, race, body mass index, prebronchodilator and postbronchodilator spirometry measurements, and the usage of inhaled/systemic steroids. We applied elbow and silhouette plots to select the optimal number of clusters. These clusters were then evaluated through LightGBM's ' s supervised ML approach on their cross-validated F1 score to support their distinctiveness. RESULTS: Data from 13,498 patients with asthma with available postbronchodilator spirometry measurements were extracted to identify 5 stable clusters. Cluster 1 included a young nonsevere asthma population with normal lung function and higher frequency of acute exacerbation (0.8 /patient-year). Cluster 2 had the highest body mass index (mean +/- SD, 44.44 +/- 7.83 kg/m(2)), and the highest proportion of females (77.5%) and Blacks (28.9%). Cluster 3 comprised patients with normal lung function. Cluster 4 included patients with lower percent of predicted FEV1 of 77.03 (12.79) and poor response to bronchodilators. Cluster 5 had the lowest percent of predicted FEV1 of 68.08 (15.02), the highest postbronchodilator reversibility, and the highest proportion of severe asthma (44.9%) and blood eosinophilia (> 300 cells/mu L) (34.8%). CONCLUSIONS: Using real-world data and unsupervised ML, we classified asthma into 5 clinically important subphenotypes where group-specific fi c asthma treatment and management strategies can be designed and deployed. (c) 2024 American Academy of Allergy, Asthma & Immunology
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Novel Machine Learning Identifies Five Asthma Phenotypes Using Cluster Analysis of Real-world Data
    Wu, C.
    Sleiman, J.
    Attaway, A.
    Bleecker, E. R.
    Chedraoui, C.
    Battoul, F.
    Meyers, D. A.
    Zein, J. G.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2023, 207
  • [2] Predicting real-world response to mepolizumab in severe asthma using machine learning
    Usuba, Koyo
    Zhang, Lingjiao
    Liu, Xinyang
    Han, Tim
    Nightingale, Natalie
    Tehrani, Ali
    Zhang, Shiyuan
    Howarth, Peter
    Alfonso-Cristancho, Rafael
    EUROPEAN RESPIRATORY JOURNAL, 2024, 64
  • [3] machine learning applications using real-world data: A literature review
    Adair, Nicholas
    Icten, Zeynep
    Friedman, Mark
    Menzin, Joseph
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 : 339 - 339
  • [4] Towards Machine Learning with Zero Real-World Data
    Kang, Cholmin
    Jung, Hyunwoo
    Lee, Youngki
    WEARSYS'19: PROCEEDINGS OF THE 5TH ACM WORKSHOP ON WEARABLE SYSTEMS AND APPLICATIONS, 2019, : 41 - 46
  • [5] Using machine learning on real-world data to predict metastatic status.
    Green, Foad H.
    Huang, Hu T.
    Lerman, Michelle
    Tran, Mary
    Subramanian, Vinod
    Loving, Joshua
    Rioth, Matthew J.
    JOURNAL OF CLINICAL ONCOLOGY, 2022, 40 (16)
  • [6] IMPROVING EFFICIENCY IN ANALYSIS OF REAL-WORLD DATA WITH AN AUTOMATED MACHINE LEARNING TOOL
    Zhang, Y.
    Lo-Ciganic, W. H.
    Xie, H.
    Iyer, R.
    Snyder, D.
    Lineman, P.
    Tian, M. Y.
    VALUE IN HEALTH, 2024, 27 (06) : S271 - S271
  • [7] Predicting Response to Tocilizumab Monotherapy in Rheumatoid Arthritis: A Real-world Data Analysis Using Machine Learning
    Johansson, Fredrik D.
    Collins, Jamie E.
    Yau, Vincent
    Guan, Hongshu
    Kim, Seoyoung C.
    Losina, Elena
    Sontag, David
    Stratton, Jacklyn
    Trinh, Huong
    Greenberg, Jeffrey
    Solomon, Daniel H.
    JOURNAL OF RHEUMATOLOGY, 2021, 48 (09) : 1364 - 1370
  • [8] Cluster analysis identifies novel real-world lung diseasepulmonary hypertension subphenotypes: implications for treatment response
    Johnson, Shelsey W.
    Wang, Rui-Sheng
    Winter, Michael R.
    Gillmeyer, Kari R.
    Zeder, Katarina
    Klings, Elizabeth S.
    Goldstein, Ronald H.
    Wiener, Renda Soylemez
    Maron, Bradley A.
    ERJ OPEN RESEARCH, 2024, 10 (03)
  • [9] Real-World Data and Machine Learning to Predict Cardiac Amyloidosis
    Garcia-Garcia, Elena
    Maria Gonzalez-Romero, Gracia
    Martin-Perez, Encarna M.
    Zapata Cornejo, Enrique de Dios
    Escobar-Aguilar, Gema
    Cardenas Bonnet, Marlon Felix
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (03) : 1 - 15
  • [10] Real-World Evidence: Integrating Machine Learning with Real-World Big Data for Predictive Analytics in Healthcare
    Vecchio, Nicolas
    CARDIOLOGY, 2024,