Efficient multivariate data-oriented microaggregation

被引:0
|
作者
Josep Domingo-Ferrer
Antoni Martínez-Ballesté
Josep Maria Mateo-Sanz
Francesc Sebé
机构
[1] Rovira i Virgili University of Tarragona,Department of Computer Engineering & Maths
[2] Rovira i Virgili University of Tarragona,Statistics Group
来源
The VLDB Journal | 2006年 / 15卷
关键词
Statistical databases; Privacy; Anonymity; Statistical disclosure control; Microaggregation; Microdata protection;
D O I
暂无
中图分类号
学科分类号
摘要
Microaggregation is a family of methods for statistical disclosure control (SDC) of microdata (records on individuals and/or companies), that is, for masking microdata so that they can be released while preserving the privacy of the underlying individuals. The principle of microaggregation is to aggregate original database records into small groups prior to publication. Each group should contain at least k records to prevent disclosure of individual information, where k is a constant value preset by the data protector. Recently, microaggregation has been shown to be useful to achieve k-anonymity, in addition to it being a good masking method. Optimal microaggregation (with minimum within-groups variability loss) can be computed in polynomial time for univariate data. Unfortunately, for multivariate data it is an NP-hard problem. Several heuristic approaches to microaggregation have been proposed in the literature. Heuristics yielding groups with fixed size k tends to be more efficient, whereas data-oriented heuristics yielding variable group size tends to result in lower information loss. This paper presents new data-oriented heuristics which improve on the trade-off between computational complexity and information loss and are thus usable for large datasets.
引用
收藏
页码:355 / 369
页数:14
相关论文
共 50 条
  • [1] Efficient multivariate data-oriented microaggregation
    Domingo-Ferrer, Josep
    Martinez-Balleste, Antoni
    Mateo-Sanz, Josep Maria
    Sebe, Francesc
    VLDB JOURNAL, 2006, 15 (04): : 355 - 369
  • [2] An Adaptive Algorithm for Multivariate Data-Oriented Microaggregation
    Abidi, Balkis
    Ben Yahia, Sadok
    2015 THIRTEENTH ANNUAL CONFERENCE ON PRIVACY, SECURITY AND TRUST (PST), 2015, : 70 - 76
  • [3] Practical data-oriented microaggregation for statistical disclosure control
    Domingo-Ferrer, J
    Mateo-Sanz, JM
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (01) : 189 - 201
  • [4] Fast data-oriented microaggregation algorithm for large numerical datasets
    Mortazavi, Reza
    Jalili, Saeed
    KNOWLEDGE-BASED SYSTEMS, 2014, 67 : 195 - 205
  • [5] Data-oriented parsing
    Klein, D
    COMPUTATIONAL LINGUISTICS, 2004, 30 (02) : 240 - 244
  • [6] Design of a Data-Oriented GPC
    Guan, Zhe
    Wakitani, Shin
    Yamamoto, Toru
    2013 INTERNATIONAL CONFERENCE ON ADVANCED MECHATRONIC SYSTEMS (ICAMECHS), 2013, : 555 - 558
  • [7] Data-Oriented Transaction Execution
    Pandis, Ippokratis
    Johnson, Ryan
    Hardavellas, Nikos
    Ailamaki, Anastasia
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01): : 928 - 939
  • [8] DATA-ORIENTED EXCEPTION HANDLING
    CUI, Q
    GANNON, J
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1992, 18 (05) : 393 - 401
  • [9] Data-oriented scheduling for PROOF
    Xu, Neng
    Guan, Wen
    Wu, Sau Lan
    Ganis, Gerardo
    INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP 2010), 2011, 331
  • [10] Data-oriented language processing
    Bod, R
    Scha, R
    CORPUS-BASED METHODS IN LANGUAGE AND SPEECH PROCESSING, 1997, 2 : 137 - 173