Swapping-based Data Sanitization Method for Hiding Sensitive Frequent Itemset in Transaction Database

被引:0
|
作者
Gunawan, Dedi [1 ]
Nugroho, Yusuf Sulistyo [1 ]
Maryam [1 ]
机构
[1] Univ Muhammadiyah Surakarta, Informat Engn Dept, Surakarta, Indonesia
关键词
Transaction database; data sanitization; data mining; sensitive frequent itemset; swapping-based method; FAST ALGORITHMS; PRIVACY;
D O I
10.14569/IJACSA.2021.0121179
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Sharing a transaction database with other parties for exploring valuable information becomes more recognized by business institutions, i.e., retails and supermarkets. It offers various benefits for the institutions, such as finding customer shopping behavior and frequently bought items, known as frequent itemsets. Due to the importance of such information, some institutions may consider certain frequent itemsets as sensitive information that should be kept private. Therefore, prior to handling a database, the institutions should consider privacy preserving data mining (PPDM) techniques for preventing sensitive information breaches. Presently, several PPDM methods, such as item suppression-based methods and item insertion-based methods have been developed. Unfortunately, the methods result in significant changes to the database and induce some side effects such as hiding failure, significant data dissimilarity, misses cost, and artificial frequent itemset occurrence. In this paper, we propose a swapping-based data sanitization method that can hide the sensitive frequent itemset while at the same time it can minimize the side effects of the data sanitization process. Experimental results indicate that the proposed method outperforms existing methods in terms of minimizing the side effects.
引用
收藏
页码:693 / 701
页数:9
相关论文
共 50 条
  • [41] HFIM: a Spark-based hybrid frequent itemset mining algorithm for big data processing
    Krishan Kumar Sethi
    Dharavath Ramesh
    The Journal of Supercomputing, 2017, 73 : 3652 - 3668
  • [42] Session-based continuous query protection method for database sensitive data
    Lu, Xiaofeng
    Liang, Chen
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 194 - 194
  • [43] A Novel Nodesets-Based Frequent Itemset Mining Algorithm for Big Data using MapReduce
    Sivaiah, Borra
    Rao, Ramisetty Rajeswara
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (09) : 1051 - 1058
  • [44] An Ontology based Frequent Itemset Method to Support Research Proposal Grouping for Research Project Selection
    Xu, Wei
    Xu, Yuzhi
    Ma, Jian
    PROCEEDINGS OF THE 46TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2013, : 1174 - 1182
  • [45] Search method of time sensitive frequent itemsets in data streams
    Park, Tae-Su
    Lee, Ju-Hong
    Park, Sang-Ho
    Choi, Bumghi
    Kim, Deok-Hwan
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2006, 4225 : 511 - 518
  • [46] Minable Data Publication Based on Sensitive Association Rule Hiding
    Yang, Fan
    Lei, Xinyu
    Le, Junqing
    Mu, Nankun
    Liao, Xiaofeng
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (05): : 1247 - 1257
  • [47] A sensitive data aggregation scheme for body sensor networks based on data hiding
    Ren, Jiankang
    Wu, Guowei
    Yao, Lin
    PERSONAL AND UBIQUITOUS COMPUTING, 2013, 17 (07) : 1317 - 1329
  • [48] A sensitive data aggregation scheme for body sensor networks based on data hiding
    Jiankang Ren
    Guowei Wu
    Lin Yao
    Personal and Ubiquitous Computing, 2013, 17 : 1317 - 1329
  • [49] Reversible Data Hiding Method based on CSD Data Representation
    Li, Kuo-Hui
    Li, Chien-Sung
    Wang, Shuenn-Shyang
    Liao, Yi-Pin
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 734 - 745
  • [50] Reversibly hiding data using dual images scheme based on EMD data hiding method
    Chen, Yu
    Lin, Jiangyi
    Chang, Chin-Chen
    Hu, Yu-Chen
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2020, 21 (04) : 583 - 592