MICF: An effective sanitization algorithm for hiding sensitive patterns on data mining

被引:14
|
作者
Li, Yu-Chiang
Yeh, Jieh-Shan
Chang, Chin-Chen
机构
[1] Natl Chung Cheng Univ, Dept Comp Sci & Informat Engn, Chiayi 62102, Taiwan
[2] Providence Univ, Dept Comp Sci & Informat Management, Taichung 433, Taiwan
[3] Feng Chia Univ, Dept Informat Engn & Comp Sci, Taichung 40724, Taiwan
关键词
data mining; association rule; privacy-preserving; sensitive rule hiding;
D O I
10.1016/j.aei.2006.12.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining mechanisms have widely been applied in various businesses and manufacturing companies across many industry sectors. Sharing data or sharing mined rules has become a trend among business partnerships, as it is perceived to be a mutually benefit way of increasing productivity for all parties involved. Nevertheless, this has also increased the risk of unexpected information leaks when releasing data. To conceal restrictive itemsets (patterns) contained in the source database, a sanitization process transforms the source database into a released database that the counterpart cannot extract sensitive rules from. The transformed result also conceals non-restrictive information as an unwanted event, called a side effect or the "misses cost". The problem of finding an optimal sanitization method, which conceals all restrictive itemsets but minimizes the misses cost, is NP-hard. To address this challenging problem, this study proposes the maximum item conflict first (MICF) algorithm. Experimental results demonstrate that the proposed method is effective, has a low sanitization rate, and can generally achieve a significantly lower misses cost than those achieved by the MinFIA, MaxFIA, IGA and Algo2b methods in several real and artificial datasets. (c) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:269 / 280
页数:12
相关论文
共 50 条
  • [1] Hiding sensitive patterns for association rules mining
    Jiang, Ji-Han
    Chi, Kuang-Hui
    Kuo, Wen-Chung
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION AND MANAGEMENT SCIENCES, 2006, 5 : 229 - 232
  • [2] Swapping-based Data Sanitization Method for Hiding Sensitive Frequent Itemset in Transaction Database
    Gunawan, Dedi
    Nugroho, Yusuf Sulistyo
    Maryam
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 693 - 701
  • [3] Ant Colony System Sanitization Approach to Hiding Sensitive Itemsets
    Wu, Jimmy Ming-Tai
    Zhan, Justin
    Lin, Jerry Chun-Wei
    IEEE ACCESS, 2017, 5 : 10024 - 10039
  • [4] Effective sanitization approaches to protect sensitive knowledge in high-utility itemset mining
    Liu, Xuan
    Wen, Shiting
    Zuo, Wanli
    APPLIED INTELLIGENCE, 2020, 50 (01) : 169 - 191
  • [5] Effective sanitization approaches to protect sensitive knowledge in high-utility itemset mining
    Xuan Liu
    Shiting Wen
    Wanli Zuo
    Applied Intelligence, 2020, 50 : 169 - 191
  • [6] A Swarm-based Data Sanitization Algorithm in Privacy-Preserving Data Mining
    Ming-Tai, Jimmy
    Lin, Jerry Chun-Wei
    Djenouri, Youcef
    Fournier-Viger, Philippe
    Zhang, Yuyu
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 1461 - 1467
  • [7] An effective approach for hiding sensitive knowledge in data publishing
    Wang, Zhihui
    Liu, Bing
    Wang, Wei
    Zhou, Haofeng
    Shi, Baile
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2006, 4016 : 146 - 157
  • [8] Mining Patterns of Sensitive Data Usage
    Avdiienko, Vitalii
    2015 IEEE/ACM 37TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, VOL 2, 2015, : 891 - 894
  • [9] Document sanitization in the age of data mining
    Hakkani-Tür, D
    Tur, G
    Saygin, Y
    Tang, M
    PRIVACY AND TECHNOLOGIES OF IDENTITY: A CROSS-DISCIPLINARY CONVERSATION, 2006, : 255 - 266
  • [10] An efficient sanitization algorithm for balancing information privacy and knowledge discovery in association patterns mining
    Wang, En Tzu
    Lee, Guanling
    DATA & KNOWLEDGE ENGINEERING, 2008, 65 (03) : 463 - 484