Re-induction based mining for high utility item-sets

被引:0
|
作者
Mathur, Pushp S. [1 ]
Chand, Satish [1 ]
机构
[1] Jawaharlal Nehru Univ, Sch Comp & Syst Sci, New Delhi, India
关键词
Datasets; Data mining; Knowledge discovery; Utility mining; Pattern recognition; EFFICIENT ALGORITHMS;
D O I
10.1007/s10489-024-05855-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The High Utility Itemset mining (HUIM) is an important research area in the field of data mining and knowledge discovery. HUIM aims to discover the high utility patterns from a given database, based on a utility threshold value, where the utility is a user-defined objective function. The existing HUIM algorithms fail to consider the actual behaviour of the occurrence of patterns in database. They consider all the patterns having the same utility value to be of equal importance. However, this may not always be the case, since some patterns may occur in localized clusters in the database while others can have a more uniform sequence of occurrence. The Frequent Itemset Mining (FIM) approaches also fail to address this problem since they are based on a support framework that considers only the frequency of occurrence of an itemset in the database. To address this research gap, this study introduces a novel concept of maintaining a count value of the itemsets, called re-induction count, in order to keep track of the relative occurrence of items in the database. A novel algorithm, named Ri-Miner, is proposed to mine itemsets based on both a minimum utility threshold and their re-induction count. The experimental results show that Ri-Miner outperforms existing methods by achieving a 15% improvement in execution time and a 10% reduction in memory usage. The proposed method can be useful in various applications that require capturing the underlying occurrence behaviour of the patterns the database, like market-basket analysis, healthcare, web stream analytics, etc.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Avoidance of Model Re-Induction in SVM-based Feature Selection for Text Categorization
    Kolcz, Aleksander
    Chowdhury, Abdur
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 889 - 894
  • [22] Mining of High Average-Utility Patterns with Item-Level Thresholds
    Lin, Jerry Chun-Wei
    Li, Ting
    Fournier-Viger, Philippe
    Zhang, Ji
    Guo, Xiangmin
    JOURNAL OF INTERNET TECHNOLOGY, 2019, 20 (01): : 187 - 194
  • [23] EFFECT OF RE-INDUCTION TO HIGH-ALTITUDE ON LEFT-VENTRICULAR FUNCTION IN NORMAL MAN
    BALASUBRAMANIAN, V
    MATHEW, OP
    TIWARI, SC
    BEHL, A
    HOON, RS
    CLINICAL SCIENCE AND MOLECULAR MEDICINE, 1978, 55 (03): : P21 - P21
  • [24] CHANGES IN BODY-FLUID COMPARTMENTS ON RE-INDUCTION TO HIGH-ALTITUDE AND EFFECT OF DIURETICS
    SINGH, MV
    RAWAL, SB
    TYAGI, AK
    BHAGAT, MJK
    PARSHAD, R
    DIVEKAR, HM
    INTERNATIONAL JOURNAL OF BIOMETEOROLOGY, 1988, 32 (01) : 36 - 40
  • [25] A Compact Data Structure Based Technique for Mining Frequent Closed Item Sets
    Ahuja, Kamlesh
    Mishra, Durgesh Kumar
    Jain, Sarika
    SMART TRENDS IN INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS, SMARTCOM 2016, 2016, 628 : 503 - 508
  • [26] Frequent item sets based dimensionality reduction algorithm in data mining research
    Bao Yong
    Lu Jia-yuan
    Wu Hui-zhong
    Proceedings of 2005 Chinese Control and Decision Conference, Vols 1 and 2, 2005, : 1433 - 1435
  • [27] Item sets based graph mining algorithm and application in genetic regulatory networks
    Song, Yongling
    Chen, Su-Shing
    2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 337 - +
  • [28] Frequent Item Sets and Association Rules Mining Algorithm Based on Floyd Algorithm
    Zhang Lin
    Zhang Jianli
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (09) : 2574 - 2578
  • [29] Second Autologous Transplantation After Bortezomib-Based Re-induction Therapy: The UK Experience
    Morris, T.
    Kettle, P.
    Drake, M.
    Quinn, M.
    Cavet, J.
    Tighe, J.
    Kazmi, M.
    Streetly, M.
    Ashcroft, J.
    Cook, G.
    Cook, M.
    Cavenagh, J.
    Oakervee, H.
    Popat, R.
    CLINICAL LYMPHOMA & MYELOMA, 2009, 9 : S40 - S41
  • [30] An efficient algorithm for mining high utility itemsets with negative item values in large databases
    Chu, Chun-Jung
    Tseng, Vincent S.
    Liang, Tyne
    APPLIED MATHEMATICS AND COMPUTATION, 2009, 215 (02) : 767 - 778