A tutorial in assessing disclosure risk in microdata

被引:9
|
作者
Taylor, Leslie [1 ]
Zhou, Xiao-Hua [1 ,2 ,3 ]
Rise, Peter [1 ]
机构
[1] VA Puget Sound Hlth Care Syst, Hlth Serv Res & Dev, Seattle, WA 98108 USA
[2] Peking Univ, Int Ctr Math Res, Beijing 100871, Peoples R China
[3] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
关键词
confidentiality; data protection; disclosure risk; key variables; population unique; LOG-LINEAR MODELS; PROTECTING PRIVACY; IDENTIFICATION DISCLOSURE; K-ANONYMITY;
D O I
10.1002/sim.7667
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Statistical agencies are releasing statistical data to other agencies for research purposes or to inform public policy. Prior to data release, these agencies have a legal and ethical obligation to protect the confidentiality of individuals in the data. Agencies often release altered versions of the data, but there usually remains risks of disclosure. Many well-studied risk measures are available to assess risk; however, many agencies today continue to use subjective judgement, past experience, and ad hoc rules or checklists to assess disclosure risk. More recently, there has been a recognized demand for quantitative risk measures that provide a more objective criteria for data release. This tutorial provides an overview of the statistical disclosure control framework for microdata. We focus on the risk analysis stage within this framework by defining existing disclosure risk measures and how to estimate them with available software.
引用
收藏
页码:3693 / 3706
页数:14
相关论文
共 50 条