Efficient mining top-k high utility itemsets in incremental databases based on threshold raising strategies and pre-large concept

被引:0
|
作者
Tung, N. T. [1 ,2 ,3 ]
Nguyen, Loan T. T. [2 ,4 ]
Nguyen, Trinh D. D. [3 ]
Huynh, Bao [3 ]
机构
[1] Univ Informat Technol, Fac Comp Sci, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
[3] HUTECH Univ, Fac Informat Technol, Ho Chi Minh City, Vietnam
[4] Int Univ, Sch Comp Sci & Engn, Ho Chi Minh City, Vietnam
关键词
Data mining; Incremental databases; Incremental threshold raising strategy; Pre-large; Rescan condition; Top-k high utility itemset; ALGORITHM; PATTERNS;
D O I
10.1016/j.knosys.2025.113273
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High utility itemset mining (HUIM) is a sub-problems of frequent itemset mining (FIM) that has received a lot of interest from researchers. It is used to analyze user behavior and improve business efficiency. The top-k high utility itemsets mining (top-k HUIM) issue aims to explore the k-itemsets with the highest utility from the database to handle the difficulty of threshold selection. Top-k HUIM algorithms ignore the transactions continuously added to the database in a dynamic environment, resulting in inaccurate top-k HUI results. However, the current top-k HUIM algorithms in the incremental database require users to request mining manually, or else, have it automatically processed every time the incremental batch is scanned, which is very small compared to the original database. Re-mining when the data is not updated enough affects the results and consumes a lot of resources without obtain new valuable insights. This research presents a raising threshold strategy to take advantage of the original database's mining results combining the updated database strategies. Furthermore, the paper proposes definitions of top-k mining using pre-large concept, thresholds, conditions for re-mining and method to solve the problem of always mining. Combining the proposed techniques and strategies, a complete "PreTK" algorithm is proposed to solve the proposed issues. The experiments are deployed to compare the algorithm's performance on diverse databases with baseline algorithms. The results demonstrate that the proposed method outperforms the state-of-the-art algorithms and may provide results faster, even when remining is necessary.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Mining top-k high utility itemsets with effective threshold raising strategies
    Krishnamoorthy, Srikumar
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 117 : 148 - 165
  • [2] Mining high utility itemsets based on the pre-large concept
    Hong, T.-P. (tphong@nuk.edu.tw), 1600, Springer Science and Business Media Deutschland GmbH (20):
  • [3] An efficient algorithm for mining the top-k high utility itemsets, using novel threshold raising and pruning strategies
    Quang-Huy Duong
    Liao, Bo
    Fournier-Viger, Philippe
    Thu-Lan Dam
    KNOWLEDGE-BASED SYSTEMS, 2016, 104 : 106 - 122
  • [4] Efficient incremental high utility pattern mining based on pre-large concept
    Lee, Judae
    Yun, Unil
    Lee, Gangin
    Yoon, Eunchul
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 72 : 111 - 123
  • [5] Mining Top-k High On-shelf Utility Itemsets Using Novel Threshold Raising Strategies
    Singh, Kuldeep
    Biswas, Bhaskar
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (05)
  • [6] Top-k high utility pattern mining with effective threshold raising strategies
    Ryang, Heungmo
    Yun, Unil
    KNOWLEDGE-BASED SYSTEMS, 2015, 76 : 109 - 126
  • [7] Efficient Algorithms for Mining Top-K High Utility Itemsets
    Tseng, Vincent S.
    Wu, Cheng-Wei
    Fournier-Viger, Philippe
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) : 54 - 67
  • [8] List-based mining top-k average-utility itemsets with effective pruning and threshold raising strategies
    Li, Zelin
    Li, Gufeng
    Zhao, Le
    Shang, Tao
    APPLIED INTELLIGENCE, 2023, 53 (21) : 25678 - 25696
  • [9] List-based mining top-k average-utility itemsets with effective pruning and threshold raising strategies
    Zelin Li
    Gufeng Li
    Le Zhao
    Tao Shang
    Applied Intelligence, 2023, 53 : 25678 - 25696
  • [10] Mining Top-k Regular High-Utility Itemsets in Transactional Databases
    Kumari, P. Lalitha
    Sanjeevi, S. G.
    Rao, T. V. Madhusudhana
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2019, 15 (01) : 58 - 79