H-FHAUI: Hiding frequent high average utility itemsets

被引:6
|
作者
Le, Bac [1 ,2 ]
Truong, Tin [3 ]
Duong, Hai [3 ]
Fournier-Viger, Philippe [4 ]
Fujita, Hamido [5 ]
机构
[1] Univ Sci, Fac Informat Technol, Dept Comp Sci, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[3] Dalat Univ, Dept Math & Comp Sci, Dalat City, Vietnam
[4] Harbin Inst Technol, Sch Human & Social Sci, Shenzhen, Peoples R China
[5] Iwate Prefectural Univ, Fac Software & Informat Sci, Takizawa, Iwate, Japan
关键词
Privacy-preserving utility mining; Pattern hiding; High average-utility itemset; Upper bound; Weak upper bound; Border approach; EFFICIENT ALGORITHM; SANITIZATION; PATTERNS;
D O I
10.1016/j.ins.2022.07.027
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High average-utility itemset mining consists of analyzing a quantitative customer transactional database to identify high average-utility itemsets (HAUIs); that is sets of items that have a high average utility (e.g. profit). Although important information about customers' habits can be revealed by HAUIs, they can expose sensitive information. To address this concern, the problem of hiding frequent HAUIs (FHAUIs) is studied, which is to modify a transaction database to ensure that sensitive FHAUIs cannot be discovered. An algorithm is designed, named H-FHAUI, which relies on an extended border approach based on the support (occurrence frequency) and the average-utility of itemsets. Moreover, to hide all FHAUIs, H-FHAUI utilizes a novel extended lower border named Bd(E)(-) based on weak upper bounds on the average-utility to only hide a small number of FHAUIs. Then, all remaining FHAUIs are also hidden. Besides, H-FHAUI utilizes a novel weight-based strategy named ICS to choose items and transactions to be modified, a novel TIU-VIU structure to quickly update weak upper bounds, and a strategy named DUSWUB to quickly hide FHAUIs while ensuring that the total utility of D is preserved as much as possible. Experimental results show that H-FHAUI outperforms a baseline in terms of runtime, memory usage, and quality of the sanitized database. (C) 2022 Published by Elsevier Inc.
引用
收藏
页码:408 / 431
页数:24
相关论文
共 50 条
  • [1] CG-FHAUI: an efficient algorithm for simultaneously mining succinct pattern sets of frequent high average utility itemsets
    Duong, Hai
    Truong, Tin
    Le, Bac
    Fournier-Viger, Philippe
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (09) : 5239 - 5280
  • [2] A novel approach for hiding sensitive utility and frequent itemsets
    Liu, Xuan
    Xu, Feng
    Lv, Xin
    INTELLIGENT DATA ANALYSIS, 2018, 22 (06) : 1259 - 1278
  • [3] Hiding Sensitive High Utility and Frequent Itemsets Based on Constrained Intersection Lattice
    Huynh Trieu Vy
    Le Quoc Hai
    Nguyen Thanh Long
    Truong Ngoc Chau
    Le Quoc Hieu
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2022, 22 (01) : 3 - 23
  • [4] An Efficient Method for Hiding High Utility Itemsets
    Bay Vo
    Lin, Chun-Wei
    Hong, Tzung-Pei
    Vu, Vinh V.
    Minh Nguyen
    Bac Le
    ADVANCED METHODS AND TECHNOLOGIES FOR AGENT AND MULTI-AGENT SYSTEMS, 2013, 252 : 356 - 363
  • [5] A MaxMin approach for hiding frequent itemsets
    Moustakides, George V.
    Verykios, Vassihos S.
    DATA & KNOWLEDGE ENGINEERING, 2008, 65 (01) : 75 - 89
  • [6] Mining High Average-Utility Itemsets
    Hong, Tzung-Pei
    Lee, Cho-Han
    Wang, Shyue-Liang
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2526 - +
  • [7] High average-utility itemsets mining: a survey
    Kuldeep Singh
    Rajiv Kumar
    Bhaskar Biswas
    Applied Intelligence, 2022, 52 : 3901 - 3938
  • [8] A New Method for Mining High Average Utility Itemsets
    Lu, Tien
    Vo, Bay
    Nguyen, Hien T.
    Hong, Tzung-Pei
    COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2014, 2014, 8838 : 33 - 42
  • [9] High average-utility itemsets mining: a survey
    Singh, Kuldeep
    Kumar, Rajiv
    Biswas, Bhaskar
    APPLIED INTELLIGENCE, 2022, 52 (04) : 3901 - 3938
  • [10] An efficient algorithm for hiding sensitive-high utility itemsets
    Trieu, Vy Huynh
    Quoc, Hai Le
    Ngoc, Chau Truong
    INTELLIGENT DATA ANALYSIS, 2020, 24 (04) : 831 - 845