Machine learning enabled subgroup analysis with real-world data to inform clinical trial eligibility criteria design

被引:3
|
作者
Xu, Jie [1 ,2 ]
Zhang, Hao [2 ]
Zhang, Hansi [1 ]
Bian, Jiang [1 ]
Wang, Fei [2 ]
机构
[1] Univ Florida, Dept Hlth Outcomes & Biomed Informat, Gainesville, FL 32610 USA
[2] Weill Cornell Med, Dept Populat Hlth Sci, New York, NY 10065 USA
来源
SCIENTIFIC REPORTS | 2023年 / 13卷 / 01期
关键词
RANDOMIZED CONTROLLED-TRIALS; EXTERNAL VALIDITY; GENERALIZABILITY; RECRUITMENT; VARIABLES; DEMENTIA; ERA;
D O I
10.1038/s41598-023-27856-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Overly restrictive eligibility criteria for clinical trials may limit the generalizability of the trial results to their target real-world patient populations. We developed a novel machine learning approach using large collections of real-world data (RWD) to better inform clinical trial eligibility criteria design. We extracted patients' clinical events from electronic health records (EHRs), which include demographics, diagnoses, and drugs, and assumed certain compositions of these clinical events within an individual's EHRs can determine the subphenotypes-homogeneous clusters of patients, where patients within each subgroup share similar clinical characteristics. We introduced an outcome-guided probabilistic model to identify those subphenotypes, such that the patients within the same subgroup not only share similar clinical characteristics but also at similar risk levels of encountering severe adverse events (SAEs). We evaluated our algorithm on two previously conducted clinical trials with EHRs from the OneFlorida+ Clinical Research Consortium. Our model can clearly identify the patient subgroups who are more likely to suffer or not suffer from SAEs as subphenotypes in a transparent and interpretable way. Our approach identified a set of clinical topics and derived novel patient representations based on them. Each clinical topic represents a certain clinical event composition pattern learned from the patient EHRs. Tested on both trials, patient subgroup (#SAE=0) and patient subgroup (#SAE>0) can be well-separated by k-means clustering using the inferred topics. The inferred topics characterized as likely to align with the patient subgroup (#SAE>0) revealed meaningful combinations of clinical features and can provide data-driven recommendations for refining the exclusion criteria of clinical trials. The proposed supervised topic modeling approach can infer the clinical topics from the subphenotypes with or without SAEs. The potential rules for describing the patient subgroups with SAEs can be further derived to inform the design of clinical trial eligibility criteria.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Machine learning enabled subgroup analysis with real-world data to inform clinical trial eligibility criteria design
    Jie Xu
    Hao Zhang
    Hansi Zhang
    Jiang Bian
    Fei Wang
    Scientific Reports, 13 (1)
  • [2] Mapping Real-World Psoriatic Arthritis Patients to Clinical Trial Eligibility Criteria
    Arendse, Regan
    Haaland, Derek
    Fortin, Isabelle
    Rahman, Proton
    Rampakakis, Emmanouil
    Asin-Milan, Odalis
    Rachich, Meagan
    Nantel, Francois
    Lehman, Allen
    JOURNAL OF RHEUMATOLOGY, 2020, 47 (07) : 1035 - 1036
  • [3] Mapping Real-World Ankylosing Spondylitis Patients to Clinical Trial Eligibility Criteria
    Sholter, Dalton
    Inman, Robert
    Masetto, Ariel
    Rahman, Proton
    Rampakakis, Emmanouil
    Asin-Milan, Odalis
    Rachich, Meagan
    Nantel, Francois
    Lehman, Allen
    JOURNAL OF RHEUMATOLOGY, 2020, 47 (07) : 1101 - 1101
  • [4] Mapping Real-World Rheumatoid Arthritis Patients to Clinical Trial Eligibility Criteria
    Keystone, Edward
    Olszynski, Wojciech
    Bessette, Louis
    Rahman, Proton
    Rampakakis, Emmanouil
    Lehman, Allen
    Asin-Milan, Odalis
    Nantel, Francois
    Rachich, Meagan
    JOURNAL OF RHEUMATOLOGY, 2020, 47 (07) : 1107 - 1108
  • [5] MAPPING REAL-WORLD ANKYLOSINC SPONDYLITIS PATIENTS TO CLINICAL TRIAL ELIGIBILITY CRITERIA
    Sholter, Dalton
    Inman, Robert
    Masetto, Ariel
    Rahman, Proton
    Rampakakis, Emmanouil
    Asin-Milan, Odalis
    Rachich, Meagan
    Nantel, Francois
    Lehman, Allen J.
    JCR-JOURNAL OF CLINICAL RHEUMATOLOGY, 2020, 26 : 26 - 27
  • [6] MAPPING REAL-WORLD RHEUMATOID ARTHRITIS PATIENTS TO CLINICAL TRIAL ELIGIBILITY CRITERIA
    Keystone, Edward
    Olszynski, Wojciech
    Bessette, Louis
    Rahman, Proton
    Rampakakis, Emmanouil
    Lehman, Allen J.
    Asin-Milan, Odalis
    Nantel, Francois
    Rachich, Meagan
    JCR-JOURNAL OF CLINICAL RHEUMATOLOGY, 2020, 26 : 26 - 26
  • [7] MAPPING REAL-WORLD PSORIATIC ARTHRITIS PATIENTS TO CLINICAL TRIAL ELIGIBILITY CRITERIA
    Arendse, Regan
    Haaland, Derek
    Fortin, Isabelle
    Rahman, Proton
    Rampakakis, Emmanouil
    Asin-Milan, Odalis
    Rachich, Meagan
    Nantel, Francois
    Lehman, Allen J.
    JCR-JOURNAL OF CLINICAL RHEUMATOLOGY, 2020, 26 : 25 - 26
  • [8] Digital Trial Solutions eScreening: Software to rank patients by their predicted clinical trial eligibility using real-world data and machine learning
    Bhandari, Sudip
    Levonian, Zachary
    Annand, Robert
    Baxter, Jo-Zahn
    Cain, Jericho
    Flach, Jen
    Joseph-Thomas, Jiby
    Mueller, Judith
    JOURNAL OF CLINICAL ONCOLOGY, 2023, 41 (16)
  • [9] Using real world data to inform clinical trial design
    Houghton, Richard
    Golding, Sophie
    Rizzo, Shemra
    Sanglier, Thibaut
    Schroder, Carsten
    Shah, Jigna
    Spain, C. Victor
    Zhang, Qing
    Furby, Hannah V.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2023, 32 : 582 - 583
  • [10] A STRATEGY FOR INCREASING CLINICAL TRIAL DIVERSITY USING REAL-WORLD DATA TO INFORM PATIENT SELECTION CRITERIA
    Kargbo, I. M.
    Jin, H.
    Winer-Jones, J.
    VALUE IN HEALTH, 2022, 25 (07) : S607 - S607