A Novel Imputation Approach for Sharing Protected Public Health Data

被引:5
|
作者
Erdman, Elizabeth A. [1 ]
Young, Leonard D. [2 ]
Bernson, Dana L. [1 ]
Bauer, Cici [4 ]
Chui, Kenneth [3 ]
Stopka, Thomas J. [5 ,6 ]
机构
[1] Commonwealth Massachusetts, Off Populat Hlth, Dept Publ Hlth, Boston, MA USA
[2] Commonwealth Massachusetts, Bur Hlth Profess Licensure, Dept Publ Hlth, Boston, MA USA
[3] Tufts Univ, Dept Publ Hlth & Community Med, Boston, MA USA
[4] Univ Texas Hlth Sci Ctr Houston, Dept Biostat & Data Sci, Houston, TX USA
[5] Tufts Univ, Tufts Clin & Translat Sci Inst, Medford, MA USA
[6] Tufts Univ, Dept Publ Hlth & Community Med, Medford, MA USA
关键词
MISSING DATA;
D O I
10.2105/AJPH.2021.306432
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Objectives. To develop an imputation method to produce estimates for suppressed values within a shared government administrative data set to facilitate accurate data sharing and statistical and spatial analyses. Methods. We developed an imputation approach that incorporated known features of suppressed Massachusetts surveillance data from 2011 to 2017 to predict missing values more precisely. Our methods for 35 de-identified opioid prescription data sets combined modified previous or next substitution followed by mean imputation and a count adjustment to estimate suppressed values before sharing. We modeled 4 methods and compared the results to baseline mean imputation. Results. We assessed performance by comparing root mean squared error (RMSE), mean absolute error (MAE), and proportional variance between imputed and suppressed values. Our method outperformed mean imputation; we retained 46% of the suppressed value's proportional variance with better precision (22% lower RMSE and 26% lower MAE) than simple mean imputation. Conclusions. Our easy-to-implement imputation technique largely overcomes the adverse effects of low count value suppression with superior results to simple mean imputation. This novel method is generalizable to researchers sharing protected public health surveillance data.
引用
收藏
页码:1830 / 1838
页数:9
相关论文
共 50 条
  • [1] Virtual Aunting and Public Health Emergencies: A Novel Approach to Sharing Public Health Guidance
    Ottley, Amanda
    Stone, Courtney
    Henry, Marlyn
    Weber, Bridget
    Jackson, Tamaraleah
    Ondieki, Michael
    DISASTER MEDICINE AND PUBLIC HEALTH PREPAREDNESS, 2023, 17
  • [2] MULTIPLE IMPUTATION FOR SHARING PRECISE GEOGRAPHIES IN PUBLIC USE DATA
    Wang, Hao
    Reiter, Jerome P.
    ANNALS OF APPLIED STATISTICS, 2012, 6 (01): : 229 - 252
  • [3] Data sharing in public health emergencies
    Mietchen, D.
    INTERNATIONAL JOURNAL OF INFECTIOUS DISEASES, 2016, 53 : 35 - 36
  • [4] Sharing research data to improve public health
    Walport, Mark
    Brest, Paul
    LANCET, 2011, 377 (9765): : 537 - 539
  • [5] Sharing public health data saves lives
    Littler, K.
    INTERNATIONAL JOURNAL OF INFECTIOUS DISEASES, 2016, 53 : 24 - 25
  • [6] Sharing data for public health: where is the vision?
    Lopez, Alan D.
    BULLETIN OF THE WORLD HEALTH ORGANIZATION, 2010, 88 (06) : 467 - 467
  • [7] Sharing public health data: necessary and now
    不详
    LANCET, 2010, 375 (9730): : 1940 - 1940
  • [8] Interorganizational collaboration in public health data sharing
    Casey, Colleen
    Li, Jianling
    Berry, Michele
    JOURNAL OF HEALTH ORGANIZATION AND MANAGEMENT, 2016, 30 (06) : 855 - 871
  • [9] Empirical Comparison of Imputation Methods for Multivariate Missing Data in Public Health
    Pan, Steven
    Chen, Sixia
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2023, 20 (02)
  • [10] HonestChain: Consortium blockchain for protected data sharing in health information systems
    Soumya Purohit
    Prasad Calyam
    Mauro Lemus Alarcon
    Naga Ramya Bhamidipati
    Abu Mosa
    Khaled Salah
    Peer-to-Peer Networking and Applications, 2021, 14 : 3012 - 3028