A data-driven concept schema for defining clinical research data needs

被引:7
|
作者
Hruby, Gregory W. [1 ]
Hoxha, Julia [1 ]
Ravichandran, Praveen Chandar [1 ]
Mendonca, Eneida A. [2 ,3 ]
Hanauer, David A. [4 ,5 ]
Weng, Chunhua [1 ]
机构
[1] Columbia Univ, Dept Biomed Informat, 622 West 168 St,PH-20, New York, NY 10032 USA
[2] Univ Wisconsin, Dept Pediat, Madison, WI USA
[3] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI USA
[4] Univ Michigan, Dept Pediat, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Sch Informat, Ann Arbor, MI 48109 USA
关键词
Medical informatics; Comparative effectiveness research; Needs assessment; Data collection; Models; Theoretical; FRAMEWORK;
D O I
10.1016/j.ijmedinf.2016.03.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objectives: The Patient, Intervention, Control/Comparison, and Outcome (PICO) framework is an effective technique for framing a clinical question. We aim to develop the counterpart of PICO to structure clinical research data needs. Methods: We use a data-driven approach to abstracting key concepts representing clinical research data needs by adapting and extending an expert-derived framework originally developed for defining cancer research data needs. We annotated clinical trial eligibility criteria, EHR data request logs, and data queries to electronic health records (EHR), to extract and harmonize concept classes representing clinical research data needs. We evaluated the class coverage, class preservation from the original framework, schema generalizability, schema understandability, and schema structural correctness through a semi-structured interview with eight multidisciplinary domain experts. We iteratively refined the schema based on the evaluations. Results: Our data-driven schema preserved 68% of the 63 classes from the original framework and covered 88% (73/82) of the classes proposed by evaluators. Class coverage for participants of different backgrounds ranged from 60% to 100% with a median value of 95% agreement among the individual evaluators. The schema was found understandable and structurally sound. Conclusions: Our proposed schema may serve as the counterpart to PICO for improving the research data needs communication between researchers and informaticians. (C) 2016 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 50 条
  • [1] Data-driven understanding and refinement of schema mappings
    Yan, LL
    Miller, RJ
    Haas, LM
    Fagin, R
    SIGMOD RECORD, 2001, 30 (02) : 485 - 496
  • [2] Defining Data Science by a Data-Driven Quantification of the Community
    Emmert-Streib, Frank
    Dehmer, Matthias
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2019, 1 (01): : 235 - 251
  • [3] IMPLICIT MEMORY - A DATA-DRIVEN CONCEPT, OR CONCEPTUALLY DRIVEN DATA
    NEILL, WT
    VALDES, LA
    BECK, JL
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1990, 28 (06) : 482 - 482
  • [4] Data-driven education research
    Cooper, Melanie M.
    SCIENCE, 2007, 317 (5842) : 1171 - 1171
  • [5] Big Data as the Big Game Changer Big Data-driven world needs Big Data-driven ideology
    Smorodin, Gennady
    Kolesnichenko, Olga
    2015 9TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2015, : 40 - 43
  • [6] Cancer research using data-driven AI for clinical applications
    Hamamoto, Ryuji
    CANCER SCIENCE, 2024, 115 : 8 - 8
  • [7] Data-Driven Schema Matching in Agricultural Learning Object Repositories
    Koukourikos, Antonis
    Stoitsis, Giannis
    Karampiperis, Pythagoras
    METADATA AND SEMANTICS RESEARCH, 2012, 343 : 301 - +
  • [8] Data-driven Proficiency Profiling - Proof of Concept
    Mostafavi, Behrooz
    Barnes, Tiffany
    LAK '16 CONFERENCE PROCEEDINGS: THE SIXTH INTERNATIONAL LEARNING ANALYTICS & KNOWLEDGE CONFERENCE,, 2016, : 324 - 328
  • [9] Analysis on open data as a foundation for data-driven research
    Numajiri, Honami
    Hayashi, Takayuki
    SCIENTOMETRICS, 2024, 129 (10) : 6315 - 6332
  • [10] Defining dependable dynamic data-driven software architectures
    Bahsoon, Rami
    IRI 2007: PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2007, : 691 - 694