Statistical data integration in survey sampling: a review

被引:43
|
作者
Yang, Shu [1 ]
Kim, Jae Kwang [2 ]
机构
[1] North Carolina State Univ, Dept Stat, Raleigh, NC USA
[2] Iowa State Univ, Dept Stat, Ames, IA 50011 USA
基金
美国国家科学基金会;
关键词
Generalizability; Meta-analysis; Missing at random; Transportability; PROPENSITY SCORE; COMBINING INFORMATION; MULTIPLE SURVEYS; GENERALIZING EVIDENCE; ROBUST ESTIMATION; CAUSAL INFERENCE; MISSING DATA; PROBABILITY; CALIBRATION; IMPUTATION;
D O I
10.1007/s42081-020-00093-w
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Finite population inference is a central goal in survey sampling. Probability sampling is the main statistical approach to finite population inference. Challenges arise due to high cost and increasing non-response rates. Data integration provides a timely solution by leveraging multiple data sources to provide more robust and efficient inference than using any single data source alone. The technique for data integration varies depending on types of samples and available information to be combined. This article provides a systematic review of data integration techniques for combining probability samples, probability and non-probability samples, and probability and big data samples. We discuss a wide range of integration methods such as generalized least squares, calibration weighting, inverse probability weighting, mass imputation, and doubly robust methods. Finally, we highlight important questions for future research.
引用
收藏
页码:625 / 650
页数:26
相关论文
共 50 条
  • [31] Statistical mapping of count survey data
    Royle, JA
    Link, WA
    Sauer, JR
    PREDICTING SPECIES OCCURRENCES: ISSUES OF ACCURACY AND SCALE, 2002, : 625 - 638
  • [32] STATISTICAL HANDLING OF NATO SURVEY DATA
    CHURCHIL.E
    AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 1963, 21 (03) : 410 - &
  • [33] Foreword to the special issue on "Survey Methods for Statistical Data Integration and New Data Sources: tools and real data applications for official statistics"
    Ranalli, M. Giovanna
    Beaumont, Jean-Francois
    Bertarelli, Gaia
    Shlomo, Natalie
    METRON-INTERNATIONAL JOURNAL OF STATISTICS, 2024, 82 (01): : 1 - 3
  • [34] Foreword to the special issue on “Survey Methods for Statistical Data Integration and New Data Sources: tools and real data applications for official statistics”
    M. Giovanna Ranalli
    Jean-François Beaumont
    Gaia Bertarelli
    Natalie Shlomo
    METRON, 2024, 82 : 1 - 3
  • [35] Survey and Prospect: Data Integration Methodologies
    Wang S.
    Peng Y.-W.
    Lan H.
    Luo Q.-W.
    Peng Z.-Y.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (03): : 893 - 908
  • [36] A Survey of Privacy Preserving Data Integration
    Shelake, Vijay Maruti
    Shekokar, Narendra
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 59 - 70
  • [37] A survey of Deep Web data integration
    Liu, Wei
    Meng, Xiao-Feng
    Meng, Wei-Yi
    Jisuanji Xuebao/Chinese Journal of Computers, 2007, 30 (09): : 1475 - 1489
  • [38] Statistical Analysis with Soft Computation for Fuzzy Answering in Sampling Survey
    Wu, Berlin
    Tien-Liu, Tsung-Kuo
    Wu, Ching-Ling
    Lai, Wentsung
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2016, 45 (04) : 1295 - 1306
  • [39] Statistical sampling in the survey of events associated to low frequency observation
    Moscarella, F
    INDUSTRIE ALIMENTARI, 1996, 35 (350): : 819 - 820
  • [40] Entity level data integration by statistical methods
    Lenz, HJ
    SSDBM 2002: 15TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2003, : 3 - 4