Pre-large based high utility pattern mining for transaction insertions in incremental database

被引:16
|
作者
Kim, Hyeonmo [1 ]
Lee, Chanhee [1 ]
Ryu, Taewoong [1 ]
Kim, Heonho [1 ]
Kim, Sinyoung [1 ]
Vo, Bay [2 ]
Lin, Jerry Chun-Wei [3 ]
Yun, Unil [1 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul, South Korea
[2] HUTECH Univ, Fac Informat Technol, Ho Chi Minh City, Vietnam
[3] Western Norway Univ Appl Sci, Dept Comp Sci Elect Engn & Math Sci, Bergen, Norway
基金
新加坡国家研究基金会;
关键词
Data mining; Pattern mining; High-utility pattern; Pre-large; Incremental database; ALGORITHM; ITEMSETS;
D O I
10.1016/j.knosys.2023.110478
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High utility pattern mining has been actively researched and applied to diverse applications because it can process the database by considering the quantity and importance of items. However, traditional high utility pattern mining methods aim to handle static databases, so they cannot meet the requirements of users who want to process the dynamic environments. Although methods to process incremental databases have been proposed, they have limitations that they perform the mining process on the entire database, including already processed data, whenever data are newly inserted. The pre-large concept is one of the techniques to process the dynamic database. Utilizing the pre-large technique, we can efficiently handle the transaction insertion using the extracted patterns of the previous mining process. In this paper, we propose a novel pre-large-based approach to discover high utility patterns from incremental databases. A list structure is proposed to store the utility information of patterns, so candidate patterns are not generated, and an additional database scan is not required. Performance evaluation performed on various real and synthetic datasets shows that the proposed algorithm is more efficient and effective than the latest approaches in a dynamic environment.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Mining High Transaction-Weighted Utility Itemsets
    Lan, Guo-Cheng
    Hong, Tzung-Pei
    Tseng, Vincent S.
    2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 1, 2010, : 314 - 318
  • [42] IDHUP: Incremental Discovery of High Utility Pattern
    Yu, Lele
    Gan, Wensheng
    Chen, Zhixiong
    Liu, Yining
    JOURNAL OF INTERNET TECHNOLOGY, 2023, 24 (01): : 135 - 147
  • [43] GPU-Based Efficient Parallel Heuristic Algorithm for High-Utility Itemset Mining in Large Transaction Datasets
    Fang, Wei
    Jiang, Haipeng
    Lu, Hengyang
    Sun, Jun
    Wu, Xiaojun
    Lin, Jerry Chun-Wei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (02) : 652 - 667
  • [44] Maintenance of a Frequent-Itemset Lattice Based on Pre-large Concept
    Bay Vo
    Tuong Le
    Tzung-Pei Hong
    Bac Le
    KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2013), VOL 2, 2014, 245 : 295 - 305
  • [45] Mining of High Utility Itemsets with Negative Utility values for Incremental Datasets
    Pushp
    Chand, Satish
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 431 - 436
  • [46] Improvements of IncSpan: Incremental mining of sequential patterns in large database
    Nguyen, SN
    Sun, XZ
    Orlowska, ME
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 442 - 451
  • [47] A survey of incremental high-utility itemset mining
    Gan, Wensheng
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Chao, Han-Chieh
    Hong, Tzung-Pei
    Fujita, Hamido
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (02)
  • [48] Incrementally updating the discovered sequential patterns based on pre-large concept
    Lin, Jerry Chun-Wei
    Hong, Tzung-Pei
    Gan, Wensheng
    Chen, Hsin-Yi
    Li, Sheng-Tun
    INTELLIGENT DATA ANALYSIS, 2015, 19 (05) : 1071 - 1089
  • [49] A SAT-Based Approach for Mining High Utility Itemsets from Transaction Databases
    Hidouri, Amel
    Jabbour, Said
    Raddaoui, Badran
    Ben Yaghlane, Boutheina
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2020), 2020, 12393 : 91 - 106
  • [50] High Utility Rare Itemset Mining over Transaction Databases
    Goyal, Vikram
    Dawar, Siddharth
    Sureka, Ashish
    DATABASES IN NETWORKED INFORMATION SYSTEMS (DNIS 2015), 2015, 8999 : 27 - 40