Mining top-k high average-utility itemsets based on breadth-first search

被引：0

作者：

Liu, Xuan ^{[1
]}

Chen, Genlang ^{[1
]}

Wu, Fangyu ^{[2
]}

Wen, Shiting ^{[1
]}

Zuo, Wanli ^{[3
]}

机构：

[1] NingboTech Univ, Sch Comp & Data Engn, Ningbo 315100, Zhejiang, Peoples R China

[2] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Suzhou 215000, Jiangsu, Peoples R China

[3] Ningbo Univ, Sch Mech Engn & Mech, Ningbo 315211, Zhejiang, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 23期

关键词：

Top-k high average-utility itemsets; Breadth-first search; High average-utility itemset; Data mining; EFFICIENT ALGORITHM; PATTERNS; STREAM;

D O I：

10.1007/s10489-023-05076-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

High average-utility itemset mining is a subfield of data mining that has extensive practical applications. However, it is difficult for users to determine a proper minimum threshold because they cannot accurately predict the number of patterns mined at a given threshold. To address this issue, top-k high average-utility itemset mining has been proposed where k is the number of high average-utility itemsets to be mined. In this paper, we design an effective algorithm (named ETAUIM) for finding top-k high average-utility itemsets. ETAUIM employs a breadth-first search strategy to efficiently explore the search space, and it utilizes a tighter upper bound instead of the average-utility upper bound to limit the search space. Additionally, ETAUIM removes irrelevant items during the mining process and utilizes an early abandoning strategy to terminate unnecessary join operations in advance. To evaluate the proposed algorithm, extensive experiments were conducted on six sparse datasets and two dense datasets. Four state-of-the-art algorithms were used for comparison. The experimental results show that ETAUIM has excellent performance and scalability. Moreover, ETAUIM always performs better for sparse datasets.

引用

页码：29319 / 29337

页数：19

共 50 条

[1] Mining top-k high average-utility itemsets based on breadth-first search
Xuan Liu
Genlang Chen
Fangyu Wu
Shiting Wen
Wanli Zuo
Applied Intelligence, 2023, 53 : 29319 - 29337
[2] Top-k high average-utility itemsets mining with effective pruning strategies
Ronghui Wu
Zhan He
Applied Intelligence, 2018, 48 : 3429 - 3445
[3] Top-k high average-utility itemsets mining with effective pruning strategies
Wu, Ronghui
He, Zhan
APPLIED INTELLIGENCE, 2018, 48 (10) : 3429 - 3445
[4] Mining High Average-Utility Itemsets
Hong, Tzung-Pei
Lee, Cho-Han
Wang, Shyue-Liang
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2526 - +
[5] List-based mining top-k average-utility itemsets with effective pruning and threshold raising strategies
Li, Zelin
Li, Gufeng
Zhao, Le
Shang, Tao
APPLIED INTELLIGENCE, 2023, 53 (21) : 25678 - 25696
[6] List-based mining top-k average-utility itemsets with effective pruning and threshold raising strategies
Zelin Li
Gufeng Li
Le Zhao
Tao Shang
Applied Intelligence, 2023, 53 : 25678 - 25696
[7] Mining Top-k High Average-Utility Sequential Patterns for Resource Transformation
Cao, Kai
Duan, Yucong
APPLIED SCIENCES-BASEL, 2023, 13 (22):
[8] High average-utility itemsets mining: a survey
Kuldeep Singh
Rajiv Kumar
Bhaskar Biswas
Applied Intelligence, 2022, 52 : 3901 - 3938
[9] High average-utility itemsets mining: a survey
Singh, Kuldeep
Kumar, Rajiv
Biswas, Bhaskar
APPLIED INTELLIGENCE, 2022, 52 (04) : 3901 - 3938
[10] Mining of top-k high utility itemsets with negative utility
Sun, Rui
Han, Meng
Zhang, Chunyan
Shen, Mingyao
Du, Shiyu
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 5637 - 5652

← 1 2 3 4 5 →