Statistical disclosure control methods for census frequency tables

被引:16
|
作者
Shlomo, Natalie [1 ]
机构
[1] Univ Southampton, Stat Sci Res Inst, Southampton SO17 IBJ, Hants, England
[2] Hebrew Univ Jerusalem, Dept Stat, IL-91905 Jerusalem, Israel
关键词
disclosure risk measures; data utility measures; R-U confidentiality map;
D O I
10.1111/j.1751-5823.2007.00010.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper provides a review of common statistical disclosure control (SDC) methods implemented at statistical agencies for standard tabular outputs containing whole population counts from a census (either enumerated or based on a register). These methods include record swapping on the microdata prior to its tabulation and rounding of entries in the tables after they are produced. The approach for assessing SDC methods is based on a disclosure risk-data utility framework and the need to find a balance between managing disclosure risk while maximizing the amount of information that can be released to users and ensuring high quality outputs. To carry out the analysis, quantitative measures of disclosure risk and data utility are defined and methods compared. Conclusions from the analysis show that record swapping as a sole SDC method leaves high probabilities of disclosure risk. Targeted record swapping lowers the disclosure risk, but there is more distortion of distributions. Small cell adjustments (rounding) give protection to census tables by eliminating small cells but only one set of variables and geographies can be disseminated in order to avoid disclosure by differencing nested tables. Full random rounding offers more protection against disclosure by differencing, but margins are typically rounded separately from the internal cells and tables are not additive. Rounding procedures protect against the perception of disclosure risk compared to record swapping since no small cells appear in the tables. Combining rounding with record swapping raises the level of protection but increases the loss of utility to census tabular outputs. For some statistical analysis, the combination of record swapping and rounding balances to some degree opposing effects that the methods have on the utility of the tables.
引用
收藏
页码:199 / 217
页数:19
相关论文
共 50 条
  • [21] Finding ε and δ of Statistical Disclosure Control Systems
    Das, Saswat
    Zhu, Keyu
    Task, Christine
    Van Hentenryck, Pascal
    Fioretto, Ferdinando
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22013 - 22020
  • [22] Geographically intelligent disclosure control for flexible aggregation of census data
    Young, Caroline
    Martin, David
    Skinner, Chris
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2009, 23 (04) : 457 - 482
  • [23] Sequential Monte Carlo methods for statistical analysis of tables
    Chen, YG
    Diaconis, P
    Holmes, SR
    Liu, JS
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (469) : 109 - 120
  • [24] DISCLOSURE CONTROL AT THE LEVEL OF A NATIONAL STATISTICAL OFFICE
    Lenz, Hans L.
    ROMANIAN STATISTICAL REVIEW, 2008, (06) : 3 - 17
  • [25] A Bayesian model for disclosure control in statistical databases
    Canfora, Gerardo
    Cavallo, Bice
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (11) : 1187 - 1205
  • [26] Microaggregation heuristic applied to statistical disclosure control
    Fadel, Augusto Cesar
    Ochi, Luiz Satoru
    Moura Brito, Jose Andre de
    Semaan, Gustavo Silva
    INFORMATION SCIENCES, 2021, 548 : 37 - 55
  • [28] Microaggregation heuristic applied to statistical disclosure control
    Fadel, Augusto César
    Ochi, Luiz Satoru
    Brito, José André de Moura
    Semaan, Gustavo Silva
    Information Sciences, 2021, 548 : 37 - 55
  • [29] On the complexity of optimal microaggregation for statistical disclosure control
    Oganian, Anna
    Domingo-Ferrer, Josep
    Statistical Journal of the United Nations Economic Commission for Europe, 2001, 18 (04): : 345 - 353
  • [30] Statistical methods for some simple disclosure limitation rules
    Pannekoek, J
    STATISTICA NEERLANDICA, 1999, 53 (01) : 55 - 67