Data quality assessment framework to assess electronic medical record data for use in research

被引:48
|
作者
Reimer, Andrew P. [1 ,2 ]
Milinovich, Alex [2 ]
Madigan, Elizabeth A. [1 ]
机构
[1] Case Western Reserve Univ, Frances Payne Bolton Sch Nursing, 10900 Euclid Ave, Cleveland, OH 44106 USA
[2] Cleveland Clin, 10900 Euclid Ave, Cleveland, OH 44195 USA
基金
美国国家卫生研究院;
关键词
Electronic medical records; Evaluation & assessment; Information storage; Retrieval & integration;
D O I
10.1016/j.ijmedinf.2016.03.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Introduction: The proliferation and use of electronic medical records (EMR) in the clinical setting now provide a rich source of clinical data that can be leveraged to support research on patient outcomes, comparative effectiveness, and health systems research. Once the large volume and variety of data that robust clinical EMRs provide is aggregated, the suitability of the data for research purposes must be addressed. Therefore, the purpose of this paper is two-fold. First, we present a stepwise framework capable of guiding initial data quality assessment when matching multiple data sources regardless of context or application. Then, we demonstrate a use case of initial analysis of a longitudinal data repository of electronic health record data that illustrates the first four steps of the framework, and report results. Methods: A six-step data quality assessment framework is proposed and described that includes the following data quality assessment steps: (1) preliminary analysis, (2) documentation-longitudinal concordance, (3) breadth, (4) data element presence, (5) density, and (6) prediction. The six-step framework was applied to the Transport Data Mart a data repository that contains over 28,000 records for patients that underwent interhospital transfer that includes EMRs from the sending hospitalization, transport, and receiving hospitalization. Results: There were a total of 9557 log entries of which 8139 were successfully matched to corresponding hospital encounters. 2832 were successfully mapped to both the sending and receiving hospital encounters (resulting in a 93% automatic matching rate), with 590 including air medical transport EMR data representing a complete case for testing. Results from Step 2 indicate that once records are identified and matched, there appears to be relatively limited drop-off of additional records when the criteria for matching increases, indicating the a proportion of records consistently contain nearly complete data. Measures of central tendency used in Step 3 and 4 exhibit a right skewness suggesting that a small proportion of records contain the highest number of repeated measures for the measured variables. Conclusions: The proposed six-step data quality assessment framework is useful in establishing the meta data for a longitudinal data repository that can be replicated by other studies. There are practical issues that need to be addressed including the data quality assessments with the most prescient being the need to establish data quality metrics for benchmarking acceptable levels of EMR data inclusiveness through testing and application. (C) 2016 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:40 / 47
页数:8
相关论文
共 50 条
  • [31] A data quality assessment to inform hypertension surveillance using primary care electronic medical record data from Alberta, Canada
    Stephanie Garies
    Kerry McBrien
    Hude Quan
    Donna Manca
    Neil Drummond
    Tyler Williamson
    BMC Public Health, 21
  • [32] Data quality probes - A synergistic method for quality monitoring of electronic medical record data accuracy and healthcare provision
    Brown, PJB
    Harwood, J
    Brantigan, P
    MEDINFO 2001: PROCEEDINGS OF THE 10TH WORLD CONGRESS ON MEDICAL INFORMATICS, PTS 1 AND 2, 2001, 84 : 1116 - 1119
  • [33] Electronic health record data quality assessment and tools: a systematic review
    Lewis, Abigail E.
    Weiskopf, Nicole
    Abrams, Zachary B.
    Foraker, Randi
    Lai, Albert M.
    Payne, Philip R. O.
    Gupta, Aditi
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (10) : 1730 - 1740
  • [34] Conversion and Data Quality Assessment of Electronic Health Record Data at a Korean Tertiary Teaching Hospital to a Common Data Model for Distributed Network Research
    Yoon, Dukyong
    Ahn, Eun Kyoung
    Park, Man Young
    Cho, Soo Yeon
    Ryan, Patrick
    Schuemie, Martijn J.
    Shin, Dahye
    Park, Hojun
    Park, Rae Woong
    HEALTHCARE INFORMATICS RESEARCH, 2016, 22 (01) : 54 - +
  • [35] Secondary EMR data for quality improvement and research: A comparison of manual and electronic data collection from an integrated critical care electronic medical record system
    Brundin-Mather, Rebecca
    Soo, Andrea
    Zuege, Danny J.
    Niven, Daniel J.
    Fiest, Kirsten
    Doig, Christopher J.
    Zygun, David
    Boyd, Jamie M.
    Leigh, Jeanna Parsons
    Bagshaw, Sean M.
    Stelfox, Henry T.
    JOURNAL OF CRITICAL CARE, 2018, 47 : 295 - 301
  • [36] Using primary care electronic health record data for comparative effectiveness research: experience of data quality assessment and preprocessing in The Netherlands
    Huang, Yunyu
    Voorham, Jaco
    Haaijer-Ruskamp, Flora M.
    JOURNAL OF COMPARATIVE EFFECTIVENESS RESEARCH, 2016, 5 (04) : 345 - 354
  • [37] Application of a Data Quality Framework to Ductal Carcinoma In Situ Using Electronic Health Record Data From the All of Us Research Program
    Berman, Lew
    Ostchega, Yechiam
    Giannini, John
    Anandan, Lakshmi Priya
    Clark, Emily
    Spotnitz, Matthew
    Sulieman, Lina
    Volynski, Michael
    Ramirez, Andrea
    JCO CLINICAL CANCER INFORMATICS, 2024, 8
  • [38] Application of a Data Quality Framework to Ductal Carcinoma In Situ Using Electronic Health Record Data From the All of Us Research Program
    Berman, Lew
    Ostchega, Yechiam
    Giannini, John
    Anandan, Lakshmi Priya
    Clark, Emily
    Spotnitz, Matthew
    Sulieman, Lina
    Volynski, Michael
    Ramirez, Andrea
    JCO CLINICAL CANCER INFORMATICS, 2024, 8
  • [39] Application of a Data Quality Framework to Ductal Carcinoma In Situ Using Electronic Health Record Data From the All of Us Research Program
    Berman, Lew
    Ostchega, Yechiam
    Giannini, John
    Anandan, Lakshmi Priya
    Clark, Emily
    Spotnitz, Matthew
    Sulieman, Lina
    Volynski, Michael
    Ramirez, Andrea
    JCO CLINICAL CANCER INFORMATICS, 2024, 8
  • [40] Missing data in the electronic medical record era
    Perez, Siegfried
    Keijzers, Gerben
    EMERGENCY MEDICINE AUSTRALASIA, 2012, 24 (04) : 465 - 465