Efficient multivariate data-oriented microaggregation

被引:0
|
作者
Josep Domingo-Ferrer
Antoni Martínez-Ballesté
Josep Maria Mateo-Sanz
Francesc Sebé
机构
[1] Rovira i Virgili University of Tarragona,Department of Computer Engineering & Maths
[2] Rovira i Virgili University of Tarragona,Statistics Group
来源
The VLDB Journal | 2006年 / 15卷
关键词
Statistical databases; Privacy; Anonymity; Statistical disclosure control; Microaggregation; Microdata protection;
D O I
暂无
中图分类号
学科分类号
摘要
Microaggregation is a family of methods for statistical disclosure control (SDC) of microdata (records on individuals and/or companies), that is, for masking microdata so that they can be released while preserving the privacy of the underlying individuals. The principle of microaggregation is to aggregate original database records into small groups prior to publication. Each group should contain at least k records to prevent disclosure of individual information, where k is a constant value preset by the data protector. Recently, microaggregation has been shown to be useful to achieve k-anonymity, in addition to it being a good masking method. Optimal microaggregation (with minimum within-groups variability loss) can be computed in polynomial time for univariate data. Unfortunately, for multivariate data it is an NP-hard problem. Several heuristic approaches to microaggregation have been proposed in the literature. Heuristics yielding groups with fixed size k tends to be more efficient, whereas data-oriented heuristics yielding variable group size tends to result in lower information loss. This paper presents new data-oriented heuristics which improve on the trade-off between computational complexity and information loss and are thus usable for large datasets.
引用
收藏
页码:355 / 369
页数:14
相关论文
共 50 条
  • [41] The Data-Oriented Design Process for Game Development
    Bayliss, Jessica D.
    COMPUTER, 2022, 55 (05) : 31 - 38
  • [42] Data-oriented analyses of ciliate foraging behaviors
    Chang, Yang-Chi
    Yan, Jang-Ching
    Hwang, Jiang-Shiou
    Wu, Cheng-Han
    Lee, Meng-Tsung
    HYDROBIOLOGIA, 2011, 666 (01) : 223 - 237
  • [43] Oscar: A data-oriented overlay for heterogeneous environments
    Girdzijauskas, Sarunas
    Datta, Anwitanian
    Aberer, Karl
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 1340 - +
  • [44] Towards Data-Oriented Schedule Management in Hospital
    Tsumoto, Shusaku
    Hirano, Shoji
    Iwata, Haruko
    2014 ANNUAL SRII GLOBAL CONFERENCE (SRII), 2014, : 181 - 190
  • [45] mashpoint: Surfing the Web in a Data-Oriented Way
    Popov, Igor
    Mihajlov, Martin
    Popov, Oliver
    17TH IEEE INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES - IEEE EUROCON 2017 CONFERENCE PROCEEDINGS, 2017, : 50 - 55
  • [46] Design and Implementation of a Data-Oriented Nonlinear PIDController
    Wakitani, Shin
    Nawachi, Takuya
    Martins, Guilherme Rosado
    Yamamoto, Toru
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2013, 17 (05) : 690 - 698
  • [47] Exploitation Techniques and Defenses for Data-Oriented Attacks
    Cheng, Long
    Liljestrand, Hans
    Ahmed, Md Salman
    Nyman, Thomas
    Jaeger, Trent
    Asokan, N.
    Yao, Danfeng
    2019 IEEE SECURE DEVELOPMENT (SECDEV 2019), 2019, : 114 - 128
  • [48] Data-oriented neuron classification from their parts
    Cervantes, Evelyn Perez
    Comin, Cesar Henrique
    Cesar Junior, Roberto Marcondes
    Costa, Luciano da Fontoura
    PROCEEDINGS OF THE 2016 IEEE 12TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2016, : 243 - 250
  • [49] DataInk: Direct and Creative Data-Oriented Drawing
    Xia, Haijun
    Riche, Nathalie Henry
    Chevalier, Fanny
    De Araujo, Bruno
    Wigdor, Daniel
    PROCEEDINGS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2018), 2018,
  • [50] NOSOLOGY - VOICE FOR A SYSTEMATIC DATA-ORIENTED APPROACH
    FEIGHNER, JP
    AMERICAN JOURNAL OF PSYCHIATRY, 1979, 136 (09): : 1173 - 1174