FDHUP: Fast algorithm for mining discriminative high utility patterns

被引:53
|
作者
Lin, Jerry Chun-Wei [1 ]
Gan, Wensheng [1 ]
Fournier-Viger, Philippe [2 ]
Hong, Tzung-Pei [3 ,4 ]
Chao, Han-Chieh [1 ,5 ]
机构
[1] Harbin Inst Technol Shenzhen, Sch Comp Sci & Technol, Shenzhen, Peoples R China
[2] Harbin Inst Technol Shenzhen, Sch Nat Sci & Humanities, Shenzhen, Peoples R China
[3] Natl Univ Kaohsiung, Dept Comp Sci & Informat Engn, Kaohsiung, Taiwan
[4] Natl Sun Yat Sen Univ, Dept Comp Sci & Engn, Kaohsiung, Taiwan
[5] Natl Dong Hwa Univ, Dept Comp Sci & Informat Engn, Hualien, Taiwan
基金
中国国家自然科学基金;
关键词
Utility mining; Frequency affinity; Discriminative high utility patterns; Pruning strategies; EFFICIENT ALGORITHMS; ASSOCIATION RULES; ITEMSETS; DATABASES; PERSPECTIVE;
D O I
10.1007/s10115-016-0991-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, high utility pattern mining (HUPM) has been extensively studied. Many approaches for HUPM have been proposed in recent years, but most of them aim at mining HUPs without any consideration for their frequency. This has the major drawback that any combination of a low utility item with a very high utility pattern is regarded as a HUP, even if this combination has low affinity and contains items that rarely co-occur. Thus, frequency should be a key criterion to select HUPs. To address this issue, and derive high utility interesting patterns (HUIPs) with strong frequency affinity, the HUIPM algorithm was proposed. However, it recursively constructs a series of conditional trees to produce candidates and then derive the HUIPs. This procedure is time-consuming and may lead to a combinatorial explosion when the minimum utility threshold is set relatively low. In this paper, an efficient algorithm named fast algorithm for mining discriminative high utility patterns (DHUPs) with strong frequency affinity (FDHUP) is proposed to efficiently discover DHUPs by considering both the utility and frequency affinity constraints. Two compact structures named EI-table and FU-tree and three pruning strategies are introduced in the proposed algorithm to reduce the search space, and efficiently and effectively discover DHUPs. An extensive experimental study shows that the proposed FDHUP algorithm considerably outperforms the state-of-the-art HUIPM algorithm in terms of execution time, memory consumption, and scalability.
引用
收藏
页码:873 / 909
页数:37
相关论文
共 50 条
  • [1] FDHUP: Fast algorithm for mining discriminative high utility patterns
    Jerry Chun-Wei Lin
    Wensheng Gan
    Philippe Fournier-Viger
    Tzung-Pei Hong
    Han-Chieh Chao
    Knowledge and Information Systems, 2017, 51 : 873 - 909
  • [2] Mining Discriminative High Utility Patterns
    Lin, Jerry Chun-Wei
    Gan, Wensheng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT II, 2016, 9622 : 219 - 229
  • [3] A Fast Algorithm for Mining High Utility Itemsets
    Shankar, S.
    Purusothaman, T.
    Jayanthi, S.
    Babu, Nishanth
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1459 - +
  • [4] Mining Significant Utility Discriminative Patterns in Quantitative Databases
    Tang, Huijun
    Wang, Jufeng
    Wang, Le
    MATHEMATICS, 2023, 11 (04)
  • [5] A fast algorithm for hiding high utility sequential patterns
    Zhang, Chunkai
    Zu, Yiwen
    Nie, Junli
    Du, Linzi
    Du, Jingqi
    Hong, Siyuan
    Wu, Wenping
    2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1316 - 1322
  • [6] A fast algorithm for mining high average-utility itemsets
    Lin, Jerry Chun-Wei
    Ren, Shifeng
    Fournier-Viger, Philippe
    Hong, Tzung-Pei
    Su, Ja-Hwung
    Vo, Bay
    APPLIED INTELLIGENCE, 2017, 47 (02) : 331 - 346
  • [7] A fast algorithm for mining high average-utility itemsets
    Jerry Chun-Wei Lin
    Shifeng Ren
    Philippe Fournier-Viger
    Tzung-Pei Hong
    Ja-Hwung Su
    Bay Vo
    Applied Intelligence, 2017, 47 : 331 - 346
  • [8] An Algorithm for Mining High Utility Sequential Patterns with Time Interval
    Tran Huy Duong
    Janos, Demetrovics
    Vu Duc Thi
    Nguyen Truong Thang
    Tran The Anh
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2019, 19 (04) : 3 - 16
  • [9] Fast algorithm for high utility pattern mining with the sum of item quantities
    Ryang, Heungmo
    Yun, Unil
    Ryu, Keun Ho
    INTELLIGENT DATA ANALYSIS, 2016, 20 (02) : 395 - 415
  • [10] mHUIMiner: A Fast High Utility Itemset Mining Algorithm for Sparse Datasets
    Peng, Alex Yuxuan
    Koh, Yun Sing
    Riddle, Patricia
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT II, 2017, 10235 : 196 - 207