Statistical data integration in survey sampling: a review

被引:43
|
作者
Yang, Shu [1 ]
Kim, Jae Kwang [2 ]
机构
[1] North Carolina State Univ, Dept Stat, Raleigh, NC USA
[2] Iowa State Univ, Dept Stat, Ames, IA 50011 USA
基金
美国国家科学基金会;
关键词
Generalizability; Meta-analysis; Missing at random; Transportability; PROPENSITY SCORE; COMBINING INFORMATION; MULTIPLE SURVEYS; GENERALIZING EVIDENCE; ROBUST ESTIMATION; CAUSAL INFERENCE; MISSING DATA; PROBABILITY; CALIBRATION; IMPUTATION;
D O I
10.1007/s42081-020-00093-w
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Finite population inference is a central goal in survey sampling. Probability sampling is the main statistical approach to finite population inference. Challenges arise due to high cost and increasing non-response rates. Data integration provides a timely solution by leveraging multiple data sources to provide more robust and efficient inference than using any single data source alone. The technique for data integration varies depending on types of samples and available information to be combined. This article provides a systematic review of data integration techniques for combining probability samples, probability and non-probability samples, and probability and big data samples. We discuss a wide range of integration methods such as generalized least squares, calibration weighting, inverse probability weighting, mass imputation, and doubly robust methods. Finally, we highlight important questions for future research.
引用
收藏
页码:625 / 650
页数:26
相关论文
共 50 条
  • [41] Linking Sensory Integration and Processing With Mental Health in Autism: A Retrospective Review of Survey Data
    Spielmann, Virginia
    Burke, Hannah K.
    McCulloch, Sarah
    Mason, Alex
    Lane, Shelly J.
    AMERICAN JOURNAL OF OCCUPATIONAL THERAPY, 2023, 77 (02):
  • [42] Foreword to the special issue on "Survey Methods for Statistical Data Integration and New Data Sources" (vol 81, pg 1, 2023)
    Ranalli, M. Giovanna
    Beaumont, Jean-Francois
    Bertarelli, Gaia
    Shlomo, Natalie
    METRON-INTERNATIONAL JOURNAL OF STATISTICS, 2024, 82 (01): : 131 - 131
  • [43] Survey data with sampling weights: Is there a "best" approach?
    Platt, Robert W.
    Harper, Sam B.
    ENVIRONMENTAL RESEARCH, 2013, 120 : 143 - 144
  • [44] Statistical Methods for the Analysis of Time–Location Sampling Data
    John M. Karon
    Cyprian Wejnert
    Journal of Urban Health, 2012, 89 : 565 - 586
  • [45] Fractional Imputation in Survey Sampling: A Comparative Review
    Yang, Shu
    Kim, Jae Kwang
    STATISTICAL SCIENCE, 2016, 31 (03) : 415 - 432
  • [46] Source Data Verification by Statistical Sampling: Issues in Implementation
    Andrew P. Grieve
    Drug information journal : DIJ / Drug Information Association, 2012, 46 : 368 - 377
  • [47] THE STATISTICAL EFFECTS OF INCOMPLETE SAMPLING OF COHERENT DATA SERIES
    PARKER, DE
    JOURNAL OF CLIMATOLOGY, 1984, 4 (04): : 445 - 449
  • [48] Source Data Verification by Statistical Sampling: Issues in Implementation
    Grieve, Andrew P.
    DRUG INFORMATION JOURNAL, 2012, 46 (03): : 368 - 377
  • [49] Sampling and Analyzing Statistical Data to Predict the Performance of MOOC
    Lisitsyna, Lubov S.
    Oreshin, Svyatoslav A.
    SMART EDUCATION AND E-LEARNING 2019, 2019, 144 : 77 - 85
  • [50] Numerical integration of harmonic functions with restricted sampling data
    Wu, ZM
    Hon, YC
    JOURNAL OF COMPLEXITY, 2001, 17 (04) : 898 - 909