An efficient method for mining high utility closed itemsets

被引:56
|
作者
Nguyen, Loan T. T. [1 ]
Vu, Vinh V. [2 ]
Lam, Mi T. H. [2 ]
Duong, Thuy T. M. [2 ]
Manh, Ly T. [2 ]
Nguyen, Thuy T. T. [2 ]
Vo, Bay [3 ]
Fujita, Hamido [3 ]
机构
[1] Int Univ VNU HCMC, Sch Comp Sci & Engn, Ho Chi Minh City, Vietnam
[2] Ho Chi Minh City Univ Food Ind, Fac Informat Technol, Ho Chi Minh City, Vietnam
[3] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
关键词
Data mining; High-utility pattern; High-utility closed pattern; Early pruning strategies; Backward checking; Forward checking; FREQUENT; ALGORITHMS; PATTERN; LATTICE;
D O I
10.1016/j.ins.2019.05.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining closed high utility itemsets (CHUIs) involves finding a representative set of HUIs that is usually smaller than that of HUIs but can generate the full HUIs without loss of information. Researchers have therefore shown interest in this problem, and many methods have been proposed for mining CHUIs effectively, of which CHUI-Miner and EFIM-Closed are the two most efficient algorithms. However, these face performance issues when mining CHUIs from sparse datasets. In this paper, we propose a method for the effective mining of CHUIs in both dense and sparse datasets. We first modify the compact utility list structure in the HMiner algorithm to reduce the mining time, and then develop backward and forward checking methods using the most recently explored CHUIs and combine this with candidate building for the next levels. Finally, we apply pruning strategies to reduce the search space for the generation of CHUIs. Our experimental results show that the proposed algorithm, called HMiner-Closed, is more efficient than the state-of-the-art algorithms for both dense and sparse datasets. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:78 / 99
页数:22
相关论文
共 50 条
  • [21] ECUL-Miner: Efficiently mining high utility closed itemsets
    Zhai Yue
    Xu Qiyun
    Li Lin
    Wang Lijuan
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2337 - 2341
  • [22] Mining Closed+ High Utility Itemsets without Candidate Generation
    Wu, Cheng-Wei
    Fournier-Viger, Philippe
    Gu, Jia-Yuan
    Tseng, Vincent S.
    2015 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2015, : 187 - 194
  • [23] HIGH UTILITY ITEMSETS MINING
    Liu, Ying
    Li, Jianwei
    Liao, Wei-Keng
    Choudhary, Alok
    Shi, Yong
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2010, 9 (06) : 905 - 934
  • [24] Efficient algorithms for mining maximal high-utility itemsets
    Nguyen, Trinh D. D.
    Quoc-Bao Vu
    Nguyen, Loan T. T.
    PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 428 - 433
  • [25] Efficient Mining of Short Periodic High-Utility Itemsets
    Lin, Jerry Chun-Wei
    Zhang, Jiexiong
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    Chen, Chien-Ming
    Su, Ja-Hwung
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 3083 - 3088
  • [26] Efficient Mining of Uncertain Data for High-Utility Itemsets
    Lin, Jerry Chun-Wei
    Gan, Wensheng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    Tseng, Vincent S.
    WEB-AGE INFORMATION MANAGEMENT, PT I, 2016, 9658 : 17 - 30
  • [27] Efficient mining of high utility itemsets from large datasets
    Erwin, Alva
    Gopalan, Raj P.
    Achuthan, N. R.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 554 - +
  • [28] An efficient algorithm for mining closed high utility itemsets over data streams with one dataset scan
    Meng Han
    Haodong Cheng
    Ni Zhang
    Xiaojuan Li
    Le Wang
    Knowledge and Information Systems, 2023, 65 : 207 - 240
  • [29] An efficient algorithm for mining closed high utility itemsets over data streams with one dataset scan
    Han, Meng
    Cheng, Haodong
    Zhang, Ni
    Li, Xiaojuan
    Wang, Le
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (01) : 207 - 240
  • [30] A Robust Technique for Closed Frequent and High Utility Itemsets Mining: Closed-FHUIM
    Ashraf, Muhammad Waheed
    Naeem, M. Asif
    Lee, Heejeong Jasmine
    IEEE ACCESS, 2024, 12 : 196517 - 196532