Novel Machine Learning Identifies 5 Asthma Phenotypes Using Cluster Analysis of Real-World Data

被引:1
|
作者
Wu, Chao-Ping [1 ]
Sleiman, Joelle [2 ]
Fakhry, Battoul [2 ]
Chedraoui, Celine [2 ]
Attaway, Amy [1 ,2 ]
Bhattacharyya, Anirban [3 ]
Bleecker, Eugene R. [4 ]
Erdemir, Ahmet [1 ]
Hu, Bo [1 ]
Kethireddy, Shravan [2 ]
Meyers, Deborah A. [3 ,4 ]
Rashidi, Hooman H. [4 ,5 ]
Zein, Joe G. [3 ,4 ]
机构
[1] Cleveland Clin, Resp Inst, Cleveland, OH USA
[2] Cleveland Clin, Lerner Res Inst, Cleveland, OH USA
[3] Mayo Clin, Dept Med, Jacksonville, FL USA
[4] Mayo Clin, Dept Med, Div Pulm Med, Scottsdale, AZ 85259 USA
[5] Cleveland Clin, Pathol & Lab Med Inst, Cleveland, OH USA
基金
美国国家卫生研究院;
关键词
Asthma; Machine learning; Asthma phenotypes; Cluster analysis; EXPRESSION; SEVERITY; NETWORK; COHORT;
D O I
10.1016/j.jaip.2024.04.035
中图分类号
R392 [医学免疫学];
学科分类号
100102 ;
摘要
BACKGROUND: Asthma classification fi cation into different subphenotypes is important to guide personalized therapy and improve outcomes. OBJECTIVES: To further explore asthma heterogeneity through determination of multiple patient groups by using novel machine learning (ML) approaches and large-scale real-world data. METHODS: We used electronic health records of patients with asthma followed at the Cleveland Clinic between 2010 and 2021. We used k-prototype unsupervised ML to develop a clustering model where predictors were age, sex, race, body mass index, prebronchodilator and postbronchodilator spirometry measurements, and the usage of inhaled/systemic steroids. We applied elbow and silhouette plots to select the optimal number of clusters. These clusters were then evaluated through LightGBM's ' s supervised ML approach on their cross-validated F1 score to support their distinctiveness. RESULTS: Data from 13,498 patients with asthma with available postbronchodilator spirometry measurements were extracted to identify 5 stable clusters. Cluster 1 included a young nonsevere asthma population with normal lung function and higher frequency of acute exacerbation (0.8 /patient-year). Cluster 2 had the highest body mass index (mean +/- SD, 44.44 +/- 7.83 kg/m(2)), and the highest proportion of females (77.5%) and Blacks (28.9%). Cluster 3 comprised patients with normal lung function. Cluster 4 included patients with lower percent of predicted FEV1 of 77.03 (12.79) and poor response to bronchodilators. Cluster 5 had the lowest percent of predicted FEV1 of 68.08 (15.02), the highest postbronchodilator reversibility, and the highest proportion of severe asthma (44.9%) and blood eosinophilia (> 300 cells/mu L) (34.8%). CONCLUSIONS: Using real-world data and unsupervised ML, we classified asthma into 5 clinically important subphenotypes where group-specific fi c asthma treatment and management strategies can be designed and deployed. (c) 2024 American Academy of Allergy, Asthma & Immunology
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Distinguishing Asthma Phenotypes Using Machine Learning Approaches
    Rebecca Howard
    Magnus Rattray
    Mattia Prosperi
    Adnan Custovic
    Current Allergy and Asthma Reports, 2015, 15
  • [32] Mapping of Real-World Data Sources for Asthma in Europe
    Lemieux-Uresandi, Danae
    Bergamasco, Aurore
    Moride, Yola
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2017, 26 : 558 - 559
  • [33] Distinguishing Asthma Phenotypes Using Machine Learning Approaches
    Howard, Rebecca
    Rattray, Magnus
    Prosperi, Mattia
    Custovic, Adnan
    CURRENT ALLERGY AND ASTHMA REPORTS, 2015, 15 (07)
  • [34] Developing predictive precision medicine models by exploiting real-world data using machine learning methods
    Theocharopoulos, Panagiotis C.
    Bersimis, Sotiris
    Georgakopoulos, Spiros V.
    Karaminas, Antonis
    Tasoulis, Sotiris K.
    Plagianakos, Vassilis P.
    JOURNAL OF APPLIED STATISTICS, 2024, 51 (14) : 2980 - 3003
  • [35] Intelligent diagnosis of Kawasaki disease from real-world data using interpretable machine learning models
    Duan, Yifan
    Wang, Ruiqi
    Huang, Zhilin
    Chen, Haoran
    Tang, Mingkun
    Zhou, Jiayin
    Hu, Zhengyong
    Hu, Wanfei
    Chen, Zhenli
    Qian, Qing
    Wang, Haolin
    HELLENIC JOURNAL OF CARDIOLOGY, 2025, 81 : 38 - 48
  • [36] Replication of Real-World Evidence in Oncology Using Electronic Health Record Data Extracted by Machine Learning
    Benedum, Corey M.
    Sondhi, Arjun
    Fidyk, Erin
    Cohen, Aaron B.
    Nemeth, Sheila
    Adamson, Blythe
    Estevez, Melissa
    Bozkurt, Selen
    CANCERS, 2023, 15 (06)
  • [37] Early diagnosis of persons with von Willebrand disease using a machine learning algorithm and real-world data
    Sidonio Jr, Robert F.
    Lu, Anan
    Hale, Sarah
    Caicedo, Jorge
    Bullano, Mike
    Xing, Shan
    EXPERT REVIEW OF HEMATOLOGY, 2024, 17 (06) : 261 - 268
  • [38] Machine learning in a real-world PFO study: analysis of data from multi-centers in China
    Dongling Luo
    Ziyang Yang
    Gangcheng Zhang
    Qunshan Shen
    Hongwei Zhang
    Junxing Lai
    Hui Hu
    Jianxin He
    Shulin Wu
    Caojin Zhang
    BMC Medical Informatics and Decision Making, 22
  • [39] Machine learning in a real-world PFO study: analysis of data from multi-centers in China
    Luo, Dongling
    Yang, Ziyang
    Zhang, Gangcheng
    Shen, Qunshan
    Zhang, Hongwei
    Lai, Junxing
    Hu, Hui
    He, Jianxin
    Wu, Shulin
    Zhang, Caojin
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [40] State of health analysis of batteries at different stages based on real-world vehicle data and machine learning
    Wang, Jiegang
    Yang, Haixu
    Wang, Zhenpo
    Zhou, Yangjie
    Liu, Peng
    Hong, Jichao
    JOURNAL OF ENERGY STORAGE, 2024, 88