ANALYZING ESTABLISHMENT NONRESPONSE USING AN INTERPRETABLE REGRESSION TREE MODEL WITH LINKED ADMINISTRATIVE DATA

被引:38
|
作者
Phipps, Polly [1 ]
Toth, Daniell [1 ]
机构
[1] US Bur Labor Stat, Off Survey Methods Res, Washington, DC 20212 USA
来源
ANNALS OF APPLIED STATISTICS | 2012年 / 6卷 / 02期
关键词
Recursive partitioning; nonignorable nonresponse; propensity model; establishment survey; Classification and Regression Trees (CART); NONPARAMETRIC REGRESSION;
D O I
10.1214/11-AOAS521
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
To gain insight into how characteristics of an establishment are associated with nonresponse, a recursive partitioning algorithm is applied to the Occupational Employment Statistics May 2006 survey data to build a regression tree. The tree models an establishment's propensity to respond to the survey given certain establishment characteristics. It provides mutually exclusive cells based on the characteristics with homogeneous response propensities. This makes it easy to identify interpretable associations between the characteristic variables and an establishment's propensity to respond, something not easily done using a logistic regression propensity model. We test the model obtained using the May data against data from the November 2006 Occupational Employment Statistics survey. Testing the model on a disjoint set of establishment data with a very large sample size (n = 179,360) offers evidence that the regression tree model accurately describes the association between the establishment characteristics and the response propensity for the OES survey. The accuracy of this modeling approach is compared to that of logistic regression through simulation. This representation is then used along with frame-level administrative wage data linked to sample data to investigate the possibility of nonresponse bias. We show that without proper adjustments the nonresponse does pose a risk of bias and is possibly nonignorable.
引用
收藏
页码:772 / 794
页数:23
相关论文
共 50 条
  • [1] Assessing Nonresponse in a Longitudinal Establishment Survey Using Regression Trees
    Earp, Morgan
    Toth, Daniell
    Phipps, Polly
    Oslund, Charlotte
    JOURNAL OF OFFICIAL STATISTICS, 2018, 34 (02) : 463 - 481
  • [2] Modeling Nonresponse in Establishment Surveys: Using an Ensemble Tree Model to Create Nonresponse Propensity Scores and Detect Potential Bias in an Agricultural Survey
    Earp, Morgan
    Mitchell, Melissa
    McCarthy, Jaki
    Kreuter, Frauke
    JOURNAL OF OFFICIAL STATISTICS, 2014, 30 (04) : 701 - 719
  • [3] Evaluating the Utility of Linked Administrative Data for Nonresponse Bias Adjustment in a Piggyback Longitudinal Survey
    Buettner, Tobias J. M.
    Sakshaug, Joseph W.
    Vicari, Basha
    JOURNAL OF OFFICIAL STATISTICS, 2021, 37 (04) : 837 - 864
  • [4] A Bayesian model averaging approach to analyzing categorical data with nonignorable nonresponse
    Janicki, Ryan
    Malec, Donald
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 57 (01) : 600 - 614
  • [5] Dynamic Model Tree for Interpretable Data Stream Learning
    Haug, Johannes
    Broelemann, Klaus
    Kasneci, Gjergji
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2562 - 2574
  • [6] Using linked administrative and census data for migration research
    Ernsten, Annemarie
    McCollum, David
    Feng, Zhiqiang
    Everington, Dawn
    Huang, Zengyi
    POPULATION STUDIES-A JOURNAL OF DEMOGRAPHY, 2018, 72 (03): : 357 - 367
  • [7] Logistic regression model for analyzing extended haplotype data
    Wallenstein, S
    Hodge, SE
    Weston, A
    GENETIC EPIDEMIOLOGY, 1998, 15 (02) : 173 - 181
  • [8] Robust regression using probabilistically linked data
    Chambers, Ray L.
    Fabrizi, Enrico
    Ranalli, Maria Giovanna
    Salvati, Nicola
    Wang, Suojin
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2023, 15 (02)
  • [9] Analyzing Nonresponse in Longitudinal Surveys Using Bayesian Additive Regression Trees: A Nonparametric Event History Analysis
    Zinn, Sabine
    Gnambs, Timo
    SOCIAL SCIENCE COMPUTER REVIEW, 2022, 40 (03) : 678 - 699
  • [10] Using Administrative Data to Explore the Effect of Survey Nonresponse in the UK Employment Retention and Advancement Demonstration
    Dorsett, Richard
    Hendra, Richard
    Robins, Philip K.
    EVALUATION REVIEW, 2018, 42 (5-6) : 491 - 514