Using prior information in privacy-protecting survey designs for categorical sensitive variables

被引:0
|
作者
Heiko Groenitz
机构
[1] Philipps-University Marburg,Department for Statistics (Faculty 02)
来源
Statistical Papers | 2015年 / 56卷
关键词
Randomized response; Nonrandomized response; Bayesian estimation; EM algorithm; Data augmentation;
D O I
暂无
中图分类号
学科分类号
摘要
To gather data on sensitive characteristics, such as annual income, tax evasion, insurance fraud or students’ cheating behavior, direct questioning is problematic, because it often results in answer refusal or untruthful responses. For this reason, several randomized response (RR) and nonrandomized response (NRR) survey designs, which increase cooperation by protecting the respondents’ privacy, have been proposed in the literature. In the first part of this paper, we present a Bayesian extension of a recently published, innovative NRR method for multichotomous sensitive variables. With this extension, the investigator is able to incorporate prior information on the parameter, e.g., based on a previous study, into the estimation and to improve the estimation precision. In particular, we derive different point and interval estimates by the EM algorithm and data augmentation. The performance of the considered estimators is evaluated in a simulation study. In the second part of this article, we show that for any RR or NRR model addressing the estimation of the distribution of a categorical sensitive characteristic, the design matrices of the model play the central role for the Bayes estimation whereas the concrete answer scheme is irrelevant. This observation enables us to widely generalize the calculations from the first part and to establish a common approach for Bayes inference in RR and NRR designs for categorical sensitive variables. This unified approach covers even multi-stage models and models that require more than one sample.
引用
收藏
页码:167 / 189
页数:22
相关论文
共 50 条
  • [31] Optimization method of unbiased soil sampling and layout using categorical auxiliary variables information
    Pan, Yuchun, 1600, Chinese Society of Agricultural Engineering (30):
  • [32] Bayesian Approach for Identifying Efficient Stated-Choice Survey Designs with Reduced Prior Information
    Cavalcante, Rinaldo
    Roorda, Matthew J.
    TRANSPORTATION RESEARCH RECORD, 2011, (2246) : 38 - 46
  • [33] Security: An Effective Technique to Protecting Sensitive Information using Quick Response Code
    Khedekar, Lokesh S.
    2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 1185 - 1188
  • [34] Spatial prediction of soil organic matter using terrain indices and categorical variables as auxiliary information
    Zhang, Shiwen
    Huang, Yuanfang
    Shen, Chongyang
    Ye, Huichun
    Du, Yichun
    GEODERMA, 2012, 171 : 35 - 43
  • [36] Memorizing Algorithm: Protecting user Privacy using Historical Information of Location-Based Services
    Truong, Quynh Chi
    Truong, Anh Tuan
    Dang, Tran Khanh
    INTERNATIONAL JOURNAL OF MOBILE COMPUTING AND MULTIMEDIA COMMUNICATIONS, 2010, 2 (04) : 65 - 86
  • [38] Bayesian variable selection using partially observed categorical prior information in fine-mapping association studies
    Alenazi, Abdulaziz A.
    Cox, Angela
    Juarez, Miguel
    Lin, Wei-Yu
    Walters, Kevin
    GENETIC EPIDEMIOLOGY, 2019, 43 (06) : 690 - 703
  • [39] Complex survey design for estimation of total of sensitive variable using correlated scrambling variables
    Murtaza, Maryam
    Singh, Sarjinder
    Hussain, Zawar
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2025,
  • [40] Systematizing the State of Knowledge in Detecting Privacy Sensitive Information in Unstructured Texts using Machine Learning
    Loebner, Sascha
    Tesfay, Welderufael B.
    Bracamonte, Vanessa
    Nakamura, Toru
    2023 20TH ANNUAL INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY AND TRUST, PST, 2023, : 267 - 273