Re-induction based mining for high utility item-sets

被引:0
|
作者
Mathur, Pushp S. [1 ]
Chand, Satish [1 ]
机构
[1] Jawaharlal Nehru Univ, Sch Comp & Syst Sci, New Delhi, India
关键词
Datasets; Data mining; Knowledge discovery; Utility mining; Pattern recognition; EFFICIENT ALGORITHMS;
D O I
10.1007/s10489-024-05855-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The High Utility Itemset mining (HUIM) is an important research area in the field of data mining and knowledge discovery. HUIM aims to discover the high utility patterns from a given database, based on a utility threshold value, where the utility is a user-defined objective function. The existing HUIM algorithms fail to consider the actual behaviour of the occurrence of patterns in database. They consider all the patterns having the same utility value to be of equal importance. However, this may not always be the case, since some patterns may occur in localized clusters in the database while others can have a more uniform sequence of occurrence. The Frequent Itemset Mining (FIM) approaches also fail to address this problem since they are based on a support framework that considers only the frequency of occurrence of an itemset in the database. To address this research gap, this study introduces a novel concept of maintaining a count value of the itemsets, called re-induction count, in order to keep track of the relative occurrence of items in the database. A novel algorithm, named Ri-Miner, is proposed to mine itemsets based on both a minimum utility threshold and their re-induction count. The experimental results show that Ri-Miner outperforms existing methods by achieving a 15% improvement in execution time and a 10% reduction in memory usage. The proposed method can be useful in various applications that require capturing the underlying occurrence behaviour of the patterns the database, like market-basket analysis, healthcare, web stream analytics, etc.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Re-induction based mining for high utility item-setsRe-induction based mining for high utility item-setsP.S. Mathur and S. Chand
    Pushp S. Mathur
    Satish Chand
    Applied Intelligence, 2025, 55 (1)
  • [2] Protocol Keywords Extraction Method Based on Frequent Item-Sets Mining
    Li, Gaochao
    Qian, Qiang
    Wang, Zhonghua
    Zou, Xin
    Chen, Xunxun
    Wu, Xiao
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEM (ICISS 2018), 2018, : 53 - 58
  • [3] Ordering Policy Estimation for High Utility Item-Sets Considering Negative Item Values in Large Databases
    Agarwal, Reshu
    INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2022, 14 (01)
  • [4] High-utility pattern mining: A method for discovery of high-utility item sets
    Hu, Jianying
    Mojsilovic, Aleksandra
    PATTERN RECOGNITION, 2007, 40 (11) : 3317 - 3324
  • [5] A Classification Algorithm based on an Association Rule of Multiple Frequent Item-sets
    Liang, ZhiHeng
    HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 3, PROCEEDINGS, 2009, : 278 - 282
  • [6] Method for Mining Frequent Item Sets Considering Average Utility
    Agarwal, Reshu
    Gautam, Arti
    Saksena, Ayush Kumar
    Rai, Amrita
    Karatangi, Shylaja VinayKumar
    2021 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2021, : 275 - 278
  • [7] Bayesian classifier based on frequent item sets mining
    Xu, Junming
    Jiang, Yuan
    Zhou, Zhihua
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2007, 44 (08): : 1293 - 1300
  • [8] A new mining algorithm based on frequent item sets
    Wen Yun
    FIRST INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, : 410 - 413
  • [9] Pairwise Preference over Mixed-Type Item-sets Based Bayesian Personalized Ranking for Collaborative Filtering
    Gao, Shan
    Guo, Guibing
    Lin, Yusong
    Zhang, Xingjin
    Liu, Yongpeng
    Wang, Zongmin
    2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 30 - 37
  • [10] A guided FP-Growth algorithm for mining multitude-targeted item-sets and class association rules in imbalanced data
    Shabtay, Lior
    Fournier-Viger, Philippe
    Yaari, Rami
    Dattner, Itai
    INFORMATION SCIENCES, 2021, 553 (553) : 353 - 375