Hiding sensitive itemsets with multiple objective optimization

被引:0
|
作者
Jerry Chun-Wei Lin
Yuyu Zhang
Binbin Zhang
Philippe Fournier-Viger
Youcef Djenouri
机构
[1] Harbin Institute of Technology (Shenzhen),School of Computer Science and Technology
[2] Western Norway University of Applied Sciences,Department of Computing, Mathematics, and Physics
[3] Shenzhen University Health Science Center,Department of Biochemistry and Molecular Biology
[4] Shenzhen University Health Science Center,Center for Anti
[5] Harbin Institute of Technology Shenzhen Graduate School,Aging and Regenerative Medicine
[6] IMADA,School of Natural Sciences and Humanities
[7] Southern Denmark University,undefined
来源
Soft Computing | 2019年 / 23卷
关键词
PPDM; Sanitization; Evolutionary computation; Pre-large concept; Pareto solutions;
D O I
暂无
中图分类号
学科分类号
摘要
Privacy-preserving data mining (PPDM) has become an important research topic, as it can hide sensitive information, while ensuring that information can still be extracted for decision making. While performing the sanitization progress for hiding the sensitive information, three side effects such as hiding failure, missing cost, and artificial cost happen at the same time. Several evolutionary algorithms were introduced to minimize those three side effects of PPDM using a single-objective function that generates one solution for sanitization. This paper presents a multiobjective algorithm (NSGA2DT) with two strategies for hiding sensitive information with transaction deletion based on the NSGA-II framework. To obtain better balance of side effects, the designed NSGA2DT takes database dissimilarity (Dis) as one more factor to achieve better performance in terms of four side effects. Moreover, instead of a single solution of the sanitization progress, the designed NSGA2DT provides more than one solutions than those of single-objective evolutionary algorithms, which shows flexibility to select the most appropriate transactions for deletion depending on user’s preference. A Fast SoRting strategy (FSR) and the pre-large concept are utilized, respectively, in this paper to find the optimized transactions for deletion and speed up the iterative process. Based on the developed NSGA2DT, the set of several Pareto solutions can be easily discovered, thus avoiding the problem of local optimization of single-objective approaches. Besides, the designed NSGA2DT does not require to set initial weights for evaluating the side effects, and thus, the results could not be seriously influenced by the predefined weights. Experimental results show that the proposed NSGA2DT provides satisfactory results with reduced side effects, compared to previous evolutionary approaches with single-objective function.
引用
收藏
页码:12779 / 12797
页数:18
相关论文
共 50 条
  • [21] Hiding Sensitive High Utility and Frequent Itemsets Based on Constrained Intersection Lattice
    Huynh Trieu Vy
    Le Quoc Hai
    Nguyen Thanh Long
    Truong Ngoc Chau
    Le Quoc Hieu
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2022, 22 (01) : 3 - 23
  • [22] A MODIFIED HIDING HIGH UTILITY ITEM FIRST ALGORITHM (HHUIF) WITH ITEM SELECTOR (MHIS) FOR HIDING SENSITIVE ITEMSETS
    Selvaraj, Rajalakshmi
    Kuthadi, Venu Madhav
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2013, 9 (12): : 4851 - 4862
  • [24] The GA-based algorithms for optimizing hiding sensitive itemsets through transaction deletion
    Lin, Chun-Wei
    Hong, Tzung-Pei
    Yang, Kuo-Tung
    Wang, Leon Shyue-Liang
    APPLIED INTELLIGENCE, 2015, 42 (02) : 210 - 230
  • [25] The GA-based algorithms for optimizing hiding sensitive itemsets through transaction deletion
    Chun-Wei Lin
    Tzung-Pei Hong
    Kuo-Tung Yang
    Shyue-Liang Wang
    Applied Intelligence, 2015, 42 : 210 - 230
  • [26] A MaxMin approach for hiding frequent itemsets
    Moustakides, George V.
    Verykios, Vassihos S.
    DATA & KNOWLEDGE ENGINEERING, 2008, 65 (01) : 75 - 89
  • [27] An Efficient Method for Hiding High Utility Itemsets
    Bay Vo
    Lin, Chun-Wei
    Hong, Tzung-Pei
    Vu, Vinh V.
    Minh Nguyen
    Bac Le
    ADVANCED METHODS AND TECHNOLOGIES FOR AGENT AND MULTI-AGENT SYSTEMS, 2013, 252 : 356 - 363
  • [28] Frequent Itemsets Hiding: A Performance Evaluation Framework
    Abul, Osman
    Gokce, Harun
    Sengez, Yagmur
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 666 - 671
  • [29] Fast algorithms for hiding sensitive high-utility itemsets in privacy-preserving utility mining
    Lin, Jerry Chun-Wei
    Wu, Tsu-Yang
    Fournier-Viger, Philippe
    Lin, Guo
    Zhan, Justin
    Voznak, Miroslav
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 55 : 269 - 284
  • [30] Hiding frequent patterns under multiple sensitive thresholds
    Kuo, Ya-Ping
    Lin, Pai-Yu
    Dai, Bi-Ru
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, 5181 : 5 - 18