Dynamic categorization of clinical research eligibility criteria by hierarchical clustering

被引:34
|
作者
Luo, Zhihui [1 ]
Yetisgen-Yildiz, Meliha [2 ]
Weng, Chunhua [1 ]
机构
[1] Columbia Univ, Dept Biomed Informat, New York, NY 10032 USA
[2] Univ Washington, Seattle, WA 98195 USA
关键词
Clinical research eligibility criteria; Classification; Hierarchical clustering; Knowledge representation; Unified Medical Language System (UMLS); Machine learning; Feature representation; CLASSIFICATION;
D O I
10.1016/j.jbi.2011.06.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: To semi-automatically induce semantic categories of eligibility criteria from text and to automatically classify eligibility criteria based on their semantic similarity. Design: The UMLS semantic types and a set of previously developed semantic preference rules were utilized to create an unambiguous semantic feature representation to induce eligibility criteria categories through hierarchical clustering and to train supervised classifiers. Measurements: We induced 27 categories and measured the prevalence of the categories in 27,278 eligibility criteria from 1578 clinical trials and compared the classification performance (i.e., precision, recall, and F1-score) between the UMLS-based feature representation and the "bag of words" feature representation among five common classifiers in Weka, including J48, Bayesian Network, Naive Bayesian, Nearest Neighbor, and instance-based learning classifier. Results: The UMLS semantic feature representation outperforms the "bag of words" feature representation in 89% of the criteria categories. Using the semantically induced categories, machine-learning classifiers required only 2000 instances to stabilize classification performance. The J48 classifier yielded the best F1-score and the Bayesian Network classifier achieved the best learning efficiency. Conclusion: The UMLS is an effective knowledge source and can enable an efficient feature representation for semi-automated semantic category induction and automatic categorization for clinical research eligibility criteria and possibly other clinical text. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:927 / 935
页数:9
相关论文
共 50 条
  • [41] Eligibility criteria related to hormone therapy in acne clinical trials
    DeGrazia, Taryn
    Rolader, Robin
    Thiboutot, Diane
    Yeung, Howa
    JOURNAL OF THE AMERICAN ACADEMY OF DERMATOLOGY, 2020, 83 (06) : AB121 - AB121
  • [42] ASTEC: Automatic selection of clinical trials based on eligibility criteria
    Cuggia, M.
    Dufour, J. -C.
    Zekri, O.
    Gibaud, I.
    Garde, C.
    Bohec, C.
    Duvauferrier, R.
    Fieschi, D.
    Besana, P.
    Charlois, L.
    Bourde, A.
    Garcelon, N.
    Laurent, J.
    Fieschi, M.
    Dameron, O.
    IRBM, 2012, 33 (02) : 150 - 164
  • [43] Eligibility criteria in knee osteoarthritis clinical trials: systematic review
    Koog, Yun Hyung
    Wi, Hyungsun
    Jung, Won Young
    CLINICAL RHEUMATOLOGY, 2013, 32 (11) : 1569 - 1574
  • [44] Participatory Design of a Clinical Trial Eligibility Criteria Simplification Method
    Fang, Yilu
    Kim, Jae Hyun
    Idnay, Betina Ross
    Garcia, Rebeca Aragon
    Castillo, Carmen E.
    Sun, Yingcheng
    Liu, Hao
    Liu, Cong
    Yuan, Chi
    Weng, Chunhua
    PUBLIC HEALTH AND INFORMATICS, PROCEEDINGS OF MIE 2021, 2021, 281 : 984 - 988
  • [45] Eligibility criteria in knee osteoarthritis clinical trials: systematic review
    Yun Hyung Koog
    Hyungsun Wi
    Won Young Jung
    Clinical Rheumatology, 2013, 32 : 1569 - 1574
  • [46] Chia, a large annotated corpus of clinical trial eligibility criteria
    Kury, Fabricio
    Butler, Alex
    Yuan, Chi
    Fu, Li-heng
    Sun, Yingcheng
    Liu, Hao
    Sim, Ida
    Carini, Simona
    Weng, Chunhua
    SCIENTIFIC DATA, 2020, 7 (01)
  • [47] Eligibility Criteria of Randomized Clinical Trials in Critical Care Medicine
    Heirali, Alya
    Heybati, Kiyan
    Sereeyotin, Jariya
    Khan, Faizan
    Yarnell, Christopher
    Krewulak, Karla
    Murthy, Srinivas
    Burns, Karen E. A.
    Fowler, Robert
    Fiest, Kirsten
    Mehta, Sangeeta
    Canadian Crit Care Trials Grp
    JAMA NETWORK OPEN, 2025, 8 (01)
  • [48] Refining eligibility criteria for amyotrophic lateral sclerosis clinical trials
    van Eijk, Ruben P. A.
    Westeneng, Henk-Jan
    Nikolakopoulos, Stavros
    Verhagen, Iris E.
    van Es, Michael A.
    Eijkemans, Marinus J. C.
    van den Berg, Leonard H.
    NEUROLOGY, 2019, 92 (05) : E451 - E460
  • [49] Eligibility criteria for therapeutic hypothermia: From trials to clinical practice
    Mehta, Shailender
    Joshi, Anjali
    Bajuk, Barbara
    Badawi, Nadia
    McIntyre, Sarah
    Lui, Kei
    JOURNAL OF PAEDIATRICS AND CHILD HEALTH, 2017, 53 (03) : 295 - 300
  • [50] Finding structure in diversity: A hierarchical clustering-method for the categorization of allographs in handwriting
    Vuurpijl, L
    Schomaker, L
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 387 - 393