An efficient method for mining high utility closed itemsets

被引:56
|
作者
Nguyen, Loan T. T. [1 ]
Vu, Vinh V. [2 ]
Lam, Mi T. H. [2 ]
Duong, Thuy T. M. [2 ]
Manh, Ly T. [2 ]
Nguyen, Thuy T. T. [2 ]
Vo, Bay [3 ]
Fujita, Hamido [3 ]
机构
[1] Int Univ VNU HCMC, Sch Comp Sci & Engn, Ho Chi Minh City, Vietnam
[2] Ho Chi Minh City Univ Food Ind, Fac Informat Technol, Ho Chi Minh City, Vietnam
[3] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
关键词
Data mining; High-utility pattern; High-utility closed pattern; Early pruning strategies; Backward checking; Forward checking; FREQUENT; ALGORITHMS; PATTERN; LATTICE;
D O I
10.1016/j.ins.2019.05.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining closed high utility itemsets (CHUIs) involves finding a representative set of HUIs that is usually smaller than that of HUIs but can generate the full HUIs without loss of information. Researchers have therefore shown interest in this problem, and many methods have been proposed for mining CHUIs effectively, of which CHUI-Miner and EFIM-Closed are the two most efficient algorithms. However, these face performance issues when mining CHUIs from sparse datasets. In this paper, we propose a method for the effective mining of CHUIs in both dense and sparse datasets. We first modify the compact utility list structure in the HMiner algorithm to reduce the mining time, and then develop backward and forward checking methods using the most recently explored CHUIs and combine this with candidate building for the next levels. Finally, we apply pruning strategies to reduce the search space for the generation of CHUIs. Our experimental results show that the proposed algorithm, called HMiner-Closed, is more efficient than the state-of-the-art algorithms for both dense and sparse datasets. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:78 / 99
页数:22
相关论文
共 50 条
  • [41] Mining Local High Utility Itemsets
    Fournier-Viger, Philippe
    Zhang, Yimin
    Lin, Jerry Chun-Wei
    Fujita, Hamido
    Koh, Yun Sing
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 450 - 460
  • [42] Efficient Algorithms for Mining High Utility Itemsets from Transactional Databases
    Tseng, Vincent S.
    Shie, Bai-En
    Wu, Cheng-Wei
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (08) : 1772 - 1786
  • [43] An Efficient Algorithm for Mining High-Utility Itemsets with Discount Notion
    Bansal, Ruchita
    Dawar, Siddharth
    Goyal, Vikram
    BIG DATA ANALYTICS, BDA 2015, 2015, 9498 : 84 - 98
  • [44] Fast and Memory Efficient Mining of High Utility Itemsets in Data Streams
    Li, Hua-Fu
    Huang, Hsin-Yun
    Chen, Yi-Cheng
    Liu, Yu-Jiun
    Lee, Suh-Yin
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 881 - +
  • [45] Efficient Mining of High Average-Utility Itemsets with Multiple Thresholds
    Wu, Tsu-Yang
    Lin, Jerry Chun-Wei
    Ren, Shifeng
    ADVANCES IN INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, PT I, 2018, 81 : 198 - 205
  • [46] An Efficient Three-Scan Approach for Mining High Utility Itemsets
    Lan, Guo-Cheng
    Hong, Tzung-Pei
    Tseng, Vincent S.
    PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 17TH '12), 2012, : 414 - 417
  • [47] Efficient algorithms for mining high-utility itemsets in uncertain databases
    Lin, Jerry Chun-Wei
    Gan, Wensheng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    Tseng, Vincent S.
    KNOWLEDGE-BASED SYSTEMS, 2016, 96 : 171 - 187
  • [48] An efficient approach for mining association rules from high utility itemsets
    Sahoo, Jayakrushna
    Das, Ashok Kumar
    Goswami, A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) : 5754 - 5778
  • [49] Efficient mining of concise and informative representations of frequent high utility itemsets
    Tran, Thong
    Duong, Hai
    Truong, Tin
    Le, Bac
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [50] An efficient biobjective evolutionary algorithm for mining frequent and high utility itemsets
    Fang, Wei
    Li, Chongyang
    Zhang, Qiang
    Zhang, Xin
    Lin, Jerry Chun-Wei
    APPLIED SOFT COMPUTING, 2023, 140