BAYESIAN FINITE POPULATION IMPUTATION FOR DATA FUSION

被引:12
|
作者
Reiter, Jerome P. [1 ]
机构
[1] Duke Univ, Dept Stat Sci, Durham, NC 27708 USA
基金
美国国家科学基金会;
关键词
Confidentiality; disclosure; matching; multiple; sharing; synthetic; MULTIPLE IMPUTATION; FILE CONCATENATION; ADJUSTED WEIGHTS;
D O I
10.5705/ss.2010.140
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In data fusion, data owners seek to combine datasets with disjoint observations and distinct variables to estimate relationships among the variables. One approach is to concatenate the files, specify models relating the variables not jointly observed, and use the models to generate multiple imputations of the missing data. We show that the standard multiple imputation estimator of the sampling variance can have positive bias in such contexts. We present an approach for correcting this problem based on Bayesian finite population inference. We also present an approach for data fusion when some values are confidential and cannot be shared.
引用
收藏
页码:795 / 811
页数:17
相关论文
共 50 条
  • [1] BAYESIAN IMPUTATION FOR MISSING DATA
    Nads, Azman A.
    Polestico, Daisy Lou L.
    ADVANCES AND APPLICATIONS IN STATISTICS, 2022, 79 : 83 - 104
  • [2] Multiple imputation for longitudinal data using Bayesian lasso imputation model
    Yamaguchi, Yusuke
    Yoshida, Satoshi
    Misumi, Toshihiro
    Maruo, Kazushi
    STATISTICS IN MEDICINE, 2022, 41 (06) : 1042 - 1058
  • [3] A Bayesian Approach for Imputation of Censored Survival Data
    Moghaddam, Shirin
    Newell, John
    Hinde, John
    STATS, 2022, 5 (01): : 89 - 107
  • [4] Bayesian Inference for a Finite Population Total Using Linked Data
    Briscolini, Dario
    Liseo, Brunero
    Tancredi, Andrea
    SOFT METHODS FOR DATA SCIENCE, 2017, 456 : 79 - 86
  • [5] Accounting for uncertainty due to data processing in virtual population analysis using Bayesian multiple imputation
    Carruthers, Thomas
    Kell, Laurence
    Palma, Carlos
    CANADIAN JOURNAL OF FISHERIES AND AQUATIC SCIENCES, 2018, 75 (06) : 883 - 896
  • [6] A BAYESIAN BOOTSTRAP FOR A FINITE POPULATION
    LO, AY
    ANNALS OF STATISTICS, 1988, 16 (04): : 1684 - 1695
  • [7] Bayesian Multiscale Multiple Imputation With Implications for Data Confidentiality
    Holan, Scott H.
    Toth, Daniell
    Ferreira, Marco A. R.
    Karr, Alan F.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (490) : 564 - 577
  • [8] Visual analysis for panel data imputation with Bayesian network
    Hanbyul Yeon
    Seongbum Seo
    Hyesook Son
    Yun Jang
    The Journal of Supercomputing, 2022, 78 : 1759 - 1782
  • [9] Imputation of complex biological data for Bayesian network analyses
    Howey, Richard
    Cordell, Heather J.
    GENETIC EPIDEMIOLOGY, 2018, 42 (07) : 705 - 706
  • [10] Visual analysis for panel data imputation with Bayesian network
    Yeon, Hanbyul
    Seo, Seongbum
    Son, Hyesook
    Jang, Yun
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (02): : 1759 - 1782