Re-induction based mining for high utility item-sets

被引：0

作者：

Mathur, Pushp S. ^{[1
]}

Chand, Satish ^{[1
]}

机构：

[1] Jawaharlal Nehru Univ, Sch Comp & Syst Sci, New Delhi, India

来源：

APPLIED INTELLIGENCE | 2025年 / 55卷 / 01期

关键词：

Datasets; Data mining; Knowledge discovery; Utility mining; Pattern recognition; EFFICIENT ALGORITHMS;

D O I：

10.1007/s10489-024-05855-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The High Utility Itemset mining (HUIM) is an important research area in the field of data mining and knowledge discovery. HUIM aims to discover the high utility patterns from a given database, based on a utility threshold value, where the utility is a user-defined objective function. The existing HUIM algorithms fail to consider the actual behaviour of the occurrence of patterns in database. They consider all the patterns having the same utility value to be of equal importance. However, this may not always be the case, since some patterns may occur in localized clusters in the database while others can have a more uniform sequence of occurrence. The Frequent Itemset Mining (FIM) approaches also fail to address this problem since they are based on a support framework that considers only the frequency of occurrence of an itemset in the database. To address this research gap, this study introduces a novel concept of maintaining a count value of the itemsets, called re-induction count, in order to keep track of the relative occurrence of items in the database. A novel algorithm, named Ri-Miner, is proposed to mine itemsets based on both a minimum utility threshold and their re-induction count. The experimental results show that Ri-Miner outperforms existing methods by achieving a 15% improvement in execution time and a 10% reduction in memory usage. The proposed method can be useful in various applications that require capturing the underlying occurrence behaviour of the patterns the database, like market-basket analysis, healthcare, web stream analytics, etc.

引用

页数：17

共 50 条

[1] Re-induction based mining for high utility item-setsRe-induction based mining for high utility item-setsP.S. Mathur and S. Chand
Pushp S. Mathur
Satish Chand
Applied Intelligence, 2025, 55 (1)
[2] Protocol Keywords Extraction Method Based on Frequent Item-Sets Mining
Li, Gaochao
Qian, Qiang
Wang, Zhonghua
Zou, Xin
Chen, Xunxun
Wu, Xiao
PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEM (ICISS 2018), 2018, : 53 - 58
[3] Ordering Policy Estimation for High Utility Item-Sets Considering Negative Item Values in Large Databases
Agarwal, Reshu
INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2022, 14 (01)
[4] High-utility pattern mining: A method for discovery of high-utility item sets
Hu, Jianying
Mojsilovic, Aleksandra
PATTERN RECOGNITION, 2007, 40 (11) : 3317 - 3324
[5] A Classification Algorithm based on an Association Rule of Multiple Frequent Item-sets
Liang, ZhiHeng
HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 3, PROCEEDINGS, 2009, : 278 - 282
[6] Method for Mining Frequent Item Sets Considering Average Utility
Agarwal, Reshu
Gautam, Arti
Saksena, Ayush Kumar
Rai, Amrita
Karatangi, Shylaja VinayKumar
2021 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2021, : 275 - 278
[7] Bayesian classifier based on frequent item sets mining
Xu, Junming
Jiang, Yuan
Zhou, Zhihua
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2007, 44 (08): : 1293 - 1300
[8] A new mining algorithm based on frequent item sets
Wen Yun
FIRST INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, : 410 - 413
[9] Pairwise Preference over Mixed-Type Item-sets Based Bayesian Personalized Ranking for Collaborative Filtering
Gao, Shan
Guo, Guibing
Lin, Yusong
Zhang, Xingjin
Liu, Yongpeng
Wang, Zongmin
2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 30 - 37
[10] A guided FP-Growth algorithm for mining multitude-targeted item-sets and class association rules in imbalanced data
Shabtay, Lior
Fournier-Viger, Philippe
Yaari, Rami
Dattner, Itai
INFORMATION SCIENCES, 2021, 553 (553) : 353 - 375

← 1 2 3 4 5 →