Re-induction based mining for high utility item-sets

被引：0

作者：

Mathur, Pushp S. ^{[1
]}

Chand, Satish ^{[1
]}

机构：

[1] Jawaharlal Nehru Univ, Sch Comp & Syst Sci, New Delhi, India

来源：

APPLIED INTELLIGENCE | 2025年 / 55卷 / 01期

关键词：

Datasets; Data mining; Knowledge discovery; Utility mining; Pattern recognition; EFFICIENT ALGORITHMS;

D O I：

10.1007/s10489-024-05855-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The High Utility Itemset mining (HUIM) is an important research area in the field of data mining and knowledge discovery. HUIM aims to discover the high utility patterns from a given database, based on a utility threshold value, where the utility is a user-defined objective function. The existing HUIM algorithms fail to consider the actual behaviour of the occurrence of patterns in database. They consider all the patterns having the same utility value to be of equal importance. However, this may not always be the case, since some patterns may occur in localized clusters in the database while others can have a more uniform sequence of occurrence. The Frequent Itemset Mining (FIM) approaches also fail to address this problem since they are based on a support framework that considers only the frequency of occurrence of an itemset in the database. To address this research gap, this study introduces a novel concept of maintaining a count value of the itemsets, called re-induction count, in order to keep track of the relative occurrence of items in the database. A novel algorithm, named Ri-Miner, is proposed to mine itemsets based on both a minimum utility threshold and their re-induction count. The experimental results show that Ri-Miner outperforms existing methods by achieving a 15% improvement in execution time and a 10% reduction in memory usage. The proposed method can be useful in various applications that require capturing the underlying occurrence behaviour of the patterns the database, like market-basket analysis, healthcare, web stream analytics, etc.

引用

页数：17

共 50 条

[21] Avoidance of Model Re-Induction in SVM-based Feature Selection for Text Categorization
Kolcz, Aleksander
Chowdhury, Abdur
20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 889 - 894
[22] Mining of High Average-Utility Patterns with Item-Level Thresholds
Lin, Jerry Chun-Wei
Li, Ting
Fournier-Viger, Philippe
Zhang, Ji
Guo, Xiangmin
JOURNAL OF INTERNET TECHNOLOGY, 2019, 20 (01): : 187 - 194
[23] EFFECT OF RE-INDUCTION TO HIGH-ALTITUDE ON LEFT-VENTRICULAR FUNCTION IN NORMAL MAN
BALASUBRAMANIAN, V
MATHEW, OP
TIWARI, SC
BEHL, A
HOON, RS
CLINICAL SCIENCE AND MOLECULAR MEDICINE, 1978, 55 (03): : P21 - P21
[24] CHANGES IN BODY-FLUID COMPARTMENTS ON RE-INDUCTION TO HIGH-ALTITUDE AND EFFECT OF DIURETICS
SINGH, MV
RAWAL, SB
TYAGI, AK
BHAGAT, MJK
PARSHAD, R
DIVEKAR, HM
INTERNATIONAL JOURNAL OF BIOMETEOROLOGY, 1988, 32 (01) : 36 - 40
[25] A Compact Data Structure Based Technique for Mining Frequent Closed Item Sets
Ahuja, Kamlesh
Mishra, Durgesh Kumar
Jain, Sarika
SMART TRENDS IN INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS, SMARTCOM 2016, 2016, 628 : 503 - 508
[26] Frequent item sets based dimensionality reduction algorithm in data mining research
Bao Yong
Lu Jia-yuan
Wu Hui-zhong
Proceedings of 2005 Chinese Control and Decision Conference, Vols 1 and 2, 2005, : 1433 - 1435
[27] Item sets based graph mining algorithm and application in genetic regulatory networks
Song, Yongling
Chen, Su-Shing
2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 337 - +
[28] Frequent Item Sets and Association Rules Mining Algorithm Based on Floyd Algorithm
Zhang Lin
Zhang Jianli
JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (09) : 2574 - 2578
[29] Second Autologous Transplantation After Bortezomib-Based Re-induction Therapy: The UK Experience
Morris, T.
Kettle, P.
Drake, M.
Quinn, M.
Cavet, J.
Tighe, J.
Kazmi, M.
Streetly, M.
Ashcroft, J.
Cook, G.
Cook, M.
Cavenagh, J.
Oakervee, H.
Popat, R.
CLINICAL LYMPHOMA & MYELOMA, 2009, 9 : S40 - S41
[30] An efficient algorithm for mining high utility itemsets with negative item values in large databases
Chu, Chun-Jung
Tseng, Vincent S.
Liang, Tyne
APPLIED MATHEMATICS AND COMPUTATION, 2009, 215 (02) : 767 - 778

← 1 2 3 4 5 →