MICF: An effective sanitization algorithm for hiding sensitive patterns on data mining

被引:14
|
作者
Li, Yu-Chiang
Yeh, Jieh-Shan
Chang, Chin-Chen
机构
[1] Natl Chung Cheng Univ, Dept Comp Sci & Informat Engn, Chiayi 62102, Taiwan
[2] Providence Univ, Dept Comp Sci & Informat Management, Taichung 433, Taiwan
[3] Feng Chia Univ, Dept Informat Engn & Comp Sci, Taichung 40724, Taiwan
关键词
data mining; association rule; privacy-preserving; sensitive rule hiding;
D O I
10.1016/j.aei.2006.12.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining mechanisms have widely been applied in various businesses and manufacturing companies across many industry sectors. Sharing data or sharing mined rules has become a trend among business partnerships, as it is perceived to be a mutually benefit way of increasing productivity for all parties involved. Nevertheless, this has also increased the risk of unexpected information leaks when releasing data. To conceal restrictive itemsets (patterns) contained in the source database, a sanitization process transforms the source database into a released database that the counterpart cannot extract sensitive rules from. The transformed result also conceals non-restrictive information as an unwanted event, called a side effect or the "misses cost". The problem of finding an optimal sanitization method, which conceals all restrictive itemsets but minimizes the misses cost, is NP-hard. To address this challenging problem, this study proposes the maximum item conflict first (MICF) algorithm. Experimental results demonstrate that the proposed method is effective, has a low sanitization rate, and can generally achieve a significantly lower misses cost than those achieved by the MinFIA, MaxFIA, IGA and Algo2b methods in several real and artificial datasets. (c) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:269 / 280
页数:12
相关论文
共 50 条
  • [21] An Improved Sanitization Algorithm in Privacy-Preserving Utility Mining
    Liu, Xuan
    Chen, Genlang
    Wen, Shiting
    Song, Guanghui
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [22] A fuzzy data mining algorithm for incremental mining of quantitative sequential patterns
    Subramanyam, RBV
    Goswami, A
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2005, 13 (06) : 633 - 652
  • [23] An Effective Algorithm for Simultaneously Mining Frequent Patterns and Association Rules
    Wei, Wei
    Yu, Songnian
    Guo, Qiang
    Ding, Wang
    Bian, Liya
    IEEE/SOLI'2008: PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATIONS AND LOGISTICS, AND INFORMATICS, VOLS 1 AND 2, 2008, : 190 - 195
  • [24] A Metaheuristic Algorithm for Hiding Sensitive Itemsets
    Lin, Jerry Chun-Wei
    Zhang, Yuyu
    Fournier-Viger, Philippe
    Djenouri, Youcef
    Zhang, Ji
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 492 - 498
  • [25] An Efficient PANN Algorithm for Effective Spatial Data Mining
    Saranya, N. Naga
    Megala, S.
    Revathi, P.
    Nadiammai, G. V.
    Krishnaveni, S.
    Hemalatha, M.
    COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 705 - +
  • [26] Dare to share: Protecting sensitive knowledge with data sanitization
    Amiri, Ali
    DECISION SUPPORT SYSTEMS, 2007, 43 (01) : 181 - 191
  • [27] A fuzzy data mining algorithm for finding sequential patterns
    Hu, YC
    Chen, RS
    Tzeng, GH
    Shieh, JH
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2003, 11 (02) : 173 - 193
  • [28] A Pattern Decomposition Algorithm for Data Mining of Frequent Patterns
    Zou, Qinghua
    Chu, Wesley
    Johnson, David
    Chiu, Henry
    Knowledge and Information Systems, 2002, 4 (04) : 466 - 482
  • [29] BFSPMiner: an effective and efficient batch-free algorithm for mining sequential patterns over data streams
    Hassani, Marwan
    Toews, Daniel
    Cuzzocrea, Alfredo
    Seidl, Thomas
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2019, 8 (03) : 223 - 239
  • [30] BFSPMiner: an effective and efficient batch-free algorithm for mining sequential patterns over data streams
    Marwan Hassani
    Daniel Töws
    Alfredo Cuzzocrea
    Thomas Seidl
    International Journal of Data Science and Analytics, 2019, 8 : 223 - 239