An efficient method for mining high utility closed itemsets

被引:56
|
作者
Nguyen, Loan T. T. [1 ]
Vu, Vinh V. [2 ]
Lam, Mi T. H. [2 ]
Duong, Thuy T. M. [2 ]
Manh, Ly T. [2 ]
Nguyen, Thuy T. T. [2 ]
Vo, Bay [3 ]
Fujita, Hamido [3 ]
机构
[1] Int Univ VNU HCMC, Sch Comp Sci & Engn, Ho Chi Minh City, Vietnam
[2] Ho Chi Minh City Univ Food Ind, Fac Informat Technol, Ho Chi Minh City, Vietnam
[3] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
关键词
Data mining; High-utility pattern; High-utility closed pattern; Early pruning strategies; Backward checking; Forward checking; FREQUENT; ALGORITHMS; PATTERN; LATTICE;
D O I
10.1016/j.ins.2019.05.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining closed high utility itemsets (CHUIs) involves finding a representative set of HUIs that is usually smaller than that of HUIs but can generate the full HUIs without loss of information. Researchers have therefore shown interest in this problem, and many methods have been proposed for mining CHUIs effectively, of which CHUI-Miner and EFIM-Closed are the two most efficient algorithms. However, these face performance issues when mining CHUIs from sparse datasets. In this paper, we propose a method for the effective mining of CHUIs in both dense and sparse datasets. We first modify the compact utility list structure in the HMiner algorithm to reduce the mining time, and then develop backward and forward checking methods using the most recently explored CHUIs and combine this with candidate building for the next levels. Finally, we apply pruning strategies to reduce the search space for the generation of CHUIs. Our experimental results show that the proposed algorithm, called HMiner-Closed, is more efficient than the state-of-the-art algorithms for both dense and sparse datasets. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:78 / 99
页数:22
相关论文
共 50 条
  • [31] Utility-Oriented Gradual Itemsets Mining Using High Utility Itemsets Mining
    Fongue, Audrey
    Lonlac, Jerry
    Tsopze, Norbert
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2023, 2023, 14148 : 107 - 113
  • [32] GRG: An efficient method for association rules mining on frequent closed itemsets
    Li, L
    Zhai, DH
    Jin, F
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL, 2003, : 854 - 859
  • [33] An efficient fast algorithm for discovering closed+ high utility itemsets
    Sahoo, Jayakrushna
    Das, Ashok Kumar
    Goswami, A.
    APPLIED INTELLIGENCE, 2016, 45 (01) : 44 - 74
  • [34] Towards efficiently mining closed high utility itemsets from incremental databases
    Dam, Thu-Lan
    Ramampiaro, Heri
    Norvag, Kjetil
    Quang-Huy Duong
    KNOWLEDGE-BASED SYSTEMS, 2019, 165 : 13 - 29
  • [35] An efficient algorithm for mining frequent closed itemsets
    Fang, Gang
    Wu, Yue
    Li, Ming
    Chen, Jia
    Informatica (Slovenia), 2015, 39 (01): : 87 - 98
  • [36] An Efficient Algorithm for Mining Frequent Closed Itemsets
    Fang, Gang
    Wu, Yue
    Li, Ming
    Chen, Jia
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2015, 39 (01): : 87 - 98
  • [37] An efficient method for mining High-Utility itemsets from unstable negative profit databases
    Tung, N. T.
    Nguyen, Trinh D. D.
    Nguyen, Loan T. T.
    Vo, Bay
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [38] FTKHUIM: A Fast and Efficient Method for Mining Top-K High-Utility Itemsets
    Vu, Vinh V.
    Lam, Mi T. H.
    Duong, Thuy T. M.
    Manh, Ly T.
    Nguyen, Thuy T. T.
    Nguyen, Le V.
    Yun, Unil
    Snasel, Vaclav
    Vo, Bay
    IEEE ACCESS, 2023, 11 : 104789 - 104805
  • [39] An efficient fast algorithm for discovering closed+ high utility itemsets
    Jayakrushna Sahoo
    Ashok Kumar Das
    A. Goswami
    Applied Intelligence, 2016, 45 : 44 - 74
  • [40] Mining summarization of high utility itemsets
    Zhang, Xiong
    Deng, Zhi-Hong
    KNOWLEDGE-BASED SYSTEMS, 2015, 84 : 67 - 77