FTKHUIM: A Fast and Efficient Method for Mining Top-K High-Utility Itemsets

被引:8
|
作者
Vu, Vinh V. [1 ]
Lam, Mi T. H. [1 ]
Duong, Thuy T. M. [1 ]
Manh, Ly T. [1 ]
Nguyen, Thuy T. T. [1 ]
Nguyen, Le V. [1 ]
Yun, Unil [2 ]
Snasel, Vaclav [3 ]
Vo, Bay [4 ]
机构
[1] Ho Chi Minh City Univ Ind & Trade, Fac Informat Technol, Ho Chi Minh 700000, Vietnam
[2] Sejong Univ, Dept Comp Engn, Seoul 05006, South Korea
[3] VSB Tech Univ Ostrava, Fac Elect Engn & Comp Sci, Ostrava 70800, Czech Republic
[4] HUTECH Univ, Fac Informat Technol, Ho Chi Minh 700000, Vietnam
关键词
Knowledge data discovery; high-utility itemset; top-k HUIM; threshold-raising strategy; ALGORITHMS;
D O I
10.1109/ACCESS.2023.3314984
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-utility itemset mining (HUIM) is an important task in the field of knowledge data discovery. The large search space and huge number of HUIs are the consequences of applying HUIM algorithms with an inappropriate user-defined minimum utility threshold value. Determining a suitable threshold value to obtain the expected results is not a simple task and requires spending a lot of time. For common users, it is difficult to define a minimum threshold utility for exploring the right number of HUIs. On the one hand, if the threshold is set too high then the number of HUIs would not be enough. On the other hand, if the threshold is set too low, too many HUIs will be mined, thus wasting both time and memory. The top-k HUIs mining problem was proposed to solve this issue, and many effective algorithms have since been introduced by researchers. In this research, a novel approach, namely FTKHUIM (Fast top-k HUI Mining), is introduced to explore the top-k HUIs. One new threshold-raising strategy called RTU, a transaction utility (TU)-based threshold-raising strategy, has also been shown to rapidly increase the speed of top-k HUIM. The study also proposes a global structure to store utility values in the process of applying raising-threshold strategies to optimize these strategies. The results of experiments on various datasets prove that the FTKHUIM algorithm achieves better results with regard to both the time and search space needed.
引用
收藏
页码:104789 / 104805
页数:17
相关论文
共 50 条
  • [41] A General Method for mining high-Utility itemsets with correlated measures
    Nguyen Manh Hung
    Tung, N. T.
    Bay Vo
    JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2021, 5 (04) : 536 - 549
  • [42] Efficient mining of high-utility itemsets using multiple minimum utility thresholds
    Lin, Jerry Chun-Wei
    Gan, Wensheng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    Zhan, Justin
    KNOWLEDGE-BASED SYSTEMS, 2016, 113 : 100 - 115
  • [43] PHM: Mining Periodic High-Utility Itemsets
    Fournier-Viger, Philippe
    Lin, Jerry Chun-Wei
    Quang-Huy Duong
    Thu-Lan Dam
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2016, 9728 : 64 - 79
  • [44] FHN: Efficient Mining of High-Utility Itemsets with Negative Unit Profits
    Fournier-Viger, Philippe
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2014, 2014, 8933 : 16 - 29
  • [45] Efficiently mining uncertain high-utility itemsets
    Jerry Chun-Wei Lin
    Wensheng Gan
    Philippe Fournier-Viger
    Tzung-Pei Hong
    Vincent S. Tseng
    Soft Computing, 2017, 21 : 2801 - 2820
  • [46] Efficient mining of closed high-utility itemsets in dynamic and incremental databases
    Vlashejerdi, Mahnaz Naderi
    Daneshpour, Negin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 144
  • [47] Efficiently mining uncertain high-utility itemsets
    Lin, Jerry Chun-Wei
    Gan, Wensheng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    Tseng, Vincent S.
    SOFT COMPUTING, 2017, 21 (11) : 2801 - 2820
  • [48] Mining High-Utility Itemsets with Irregular Occurrence
    Laoviboon, Supachai
    Amphawan, Komate
    2017 9TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST), 2017, : 89 - 94
  • [49] Parallel Mining of Top-k High Utility Itemsets in Spark In-Memory Computing Architecture
    Lin, Chun-Han
    Wu, Cheng-Wei
    Huang, JianTao
    Tseng, Vincent S.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT II, 2019, 11440 : 253 - 265
  • [50] Efficient top-k high utility itemset mining on massive data
    Han, Xixian
    Liu, Xianmin
    Li, Jianzhong
    Gao, Hong
    INFORMATION SCIENCES, 2021, 557 : 382 - 406