Approximate weighted frequent pattern mining with/without noisy environments

被引:33
|
作者
Yun, Unil [1 ]
Ryu, Keun Ho [1 ]
机构
[1] Chungbuk Natl Univ, Dept Comp Sci, Sch Elect & Comp Engn, Cheongju, South Korea
基金
新加坡国家研究基金会;
关键词
Data mining; Knowledge discovery; Weighted frequent pattern mining; Weighted support; Approximation; CONSTRAINTS; DISCOVERY;
D O I
10.1016/j.knosys.2010.07.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In data mining area, weighted frequent pattern mining has been suggested to find important frequent patterns by considering the weights of patterns. More extensions with weight constraints have been proposed such as mining weighted association rules, weighted sequential patterns, weighted closed patterns, frequent patterns with dynamic weights, weighted graphs, and weighted sub-trees or sub structures. In previous approaches of weighted frequent pattern mining, weighted supports of patterns were exactly matched to prune weighted infrequent patterns. However, in the noisy environment, the small change in weights or supports of items affects the result sets seriously. This may make the weighted frequent patterns less useful ill the noisy environment. In this paper, we propose the robust concept of mining approximate weighted frequent patterns. Based on the framework of weight based pattern mining, an approximate factor is defined to relax the requirement for exact equality between weighted supports of patterns and a minimum threshold. After that, we address the concept of mining approximate weighted frequent patterns to find important patterns with/without the noisy data. We analyze characteristics of approximate weighted frequent patterns and run extensive performance tests. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:73 / 82
页数:10
相关论文
共 50 条
  • [31] Time-weighted counting for recently frequent pattern mining in data streams
    Kang, Yongsub U.
    Kang, U.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 53 (02) : 391 - 422
  • [32] An approximate approach to frequent itemset mining
    Zhang, Chunkai
    Zhang, Xudong
    Tian, Panbo
    2017 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC), 2017, : 68 - 73
  • [33] Approximate mining of frequent patterns on streams
    Silvestri, Claudio
    Orlando, Salvatore
    INTELLIGENT DATA ANALYSIS, 2007, 11 (01) : 49 - 73
  • [34] Mining Frequent Weighted Itemsets without Storing Transaction IDs and Generating Candidates
    Lee, Gangin
    Yun, Unil
    Ryu, Keun Ho
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2017, 25 (01) : 111 - 144
  • [35] Efficient algorithms for frequent pattern mining in many-task computing environments
    Lin, Kawuu W.
    Lo, Yu-Chin
    KNOWLEDGE-BASED SYSTEMS, 2013, 49 : 10 - 21
  • [36] A comparison between approximate counting and sampling methods for frequent pattern mining on data streams
    Ng, Willie
    Dash, Manoranjan
    INTELLIGENT DATA ANALYSIS, 2010, 14 (06) : 749 - 771
  • [37] Reframing in Frequent Pattern Mining
    Ahmed, Chowdhury Farhan
    Samiullah, Md.
    Lachiche, Nicolas
    Kull, Meelis
    Flach, Peter
    2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 799 - 806
  • [38] Weighted Frequent Subgraph Mining in Weighted Graph Databases
    Shinoda, Masaki
    Ozaki, Tomonobu
    Ohkawa, Takenao
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 58 - +
  • [39] Mining Frequent Weighted Closed Itemsets
    Bay Vo
    Nhu-Y Tran
    Duong-Ha Ngo
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2013, 479 : 379 - 390
  • [40] Analysis of tree-based uncertain frequent pattern mining techniques without pattern losses
    Lee, Gangin
    Yun, Unil
    Lee, Kyung-Min
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (11): : 4296 - 4318