Mining top-k high average-utility itemsets based on breadth-first search

被引:0
|
作者
Liu, Xuan [1 ]
Chen, Genlang [1 ]
Wu, Fangyu [2 ]
Wen, Shiting [1 ]
Zuo, Wanli [3 ]
机构
[1] NingboTech Univ, Sch Comp & Data Engn, Ningbo 315100, Zhejiang, Peoples R China
[2] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou 215000, Jiangsu, Peoples R China
[3] Ningbo Univ, Sch Mech Engn & Mech, Ningbo 315211, Zhejiang, Peoples R China
关键词
Top-k high average-utility itemsets; Breadth-first search; High average-utility itemset; Data mining; EFFICIENT ALGORITHM; PATTERNS; STREAM;
D O I
10.1007/s10489-023-05076-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High average-utility itemset mining is a subfield of data mining that has extensive practical applications. However, it is difficult for users to determine a proper minimum threshold because they cannot accurately predict the number of patterns mined at a given threshold. To address this issue, top-k high average-utility itemset mining has been proposed where k is the number of high average-utility itemsets to be mined. In this paper, we design an effective algorithm (named ETAUIM) for finding top-k high average-utility itemsets. ETAUIM employs a breadth-first search strategy to efficiently explore the search space, and it utilizes a tighter upper bound instead of the average-utility upper bound to limit the search space. Additionally, ETAUIM removes irrelevant items during the mining process and utilizes an early abandoning strategy to terminate unnecessary join operations in advance. To evaluate the proposed algorithm, extensive experiments were conducted on six sparse datasets and two dense datasets. Four state-of-the-art algorithms were used for comparison. The experimental results show that ETAUIM has excellent performance and scalability. Moreover, ETAUIM always performs better for sparse datasets.
引用
收藏
页码:29319 / 29337
页数:19
相关论文
共 50 条
  • [1] Mining top-k high average-utility itemsets based on breadth-first search
    Xuan Liu
    Genlang Chen
    Fangyu Wu
    Shiting Wen
    Wanli Zuo
    Applied Intelligence, 2023, 53 : 29319 - 29337
  • [2] Top-k high average-utility itemsets mining with effective pruning strategies
    Ronghui Wu
    Zhan He
    Applied Intelligence, 2018, 48 : 3429 - 3445
  • [3] Top-k high average-utility itemsets mining with effective pruning strategies
    Wu, Ronghui
    He, Zhan
    APPLIED INTELLIGENCE, 2018, 48 (10) : 3429 - 3445
  • [4] Mining High Average-Utility Itemsets
    Hong, Tzung-Pei
    Lee, Cho-Han
    Wang, Shyue-Liang
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2526 - +
  • [5] List-based mining top-k average-utility itemsets with effective pruning and threshold raising strategies
    Li, Zelin
    Li, Gufeng
    Zhao, Le
    Shang, Tao
    APPLIED INTELLIGENCE, 2023, 53 (21) : 25678 - 25696
  • [6] List-based mining top-k average-utility itemsets with effective pruning and threshold raising strategies
    Zelin Li
    Gufeng Li
    Le Zhao
    Tao Shang
    Applied Intelligence, 2023, 53 : 25678 - 25696
  • [7] Mining Top-k High Average-Utility Sequential Patterns for Resource Transformation
    Cao, Kai
    Duan, Yucong
    APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [8] High average-utility itemsets mining: a survey
    Kuldeep Singh
    Rajiv Kumar
    Bhaskar Biswas
    Applied Intelligence, 2022, 52 : 3901 - 3938
  • [9] High average-utility itemsets mining: a survey
    Singh, Kuldeep
    Kumar, Rajiv
    Biswas, Bhaskar
    APPLIED INTELLIGENCE, 2022, 52 (04) : 3901 - 3938
  • [10] Mining of top-k high utility itemsets with negative utility
    Sun, Rui
    Han, Meng
    Zhang, Chunyan
    Shen, Mingyao
    Du, Shiyu
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 5637 - 5652