Direct discriminative pattern mining for effective classification

被引:0
|
作者
Cheng, Hong [1 ]
Yan, Xifeng [2 ]
Han, Jiawei [1 ]
Yu, Philip S. [3 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
[2] IBM Corp, T J Watson Res Ctr, Hawthorne, NY 10504 USA
[3] Univ Illinois, Chicago, IL 60680 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The application of frequent patterns in classification has demonstrated its power in recent studies. It often adopts a two-step approach: frequent pattern (or classification rule) mining followed by feature selection (or rule ranking). However, this two-step process could be computationally expensive, especially when the problem scale is large or the minimum support is low. It was observed that frequent pattern mining usually produces a huge number of "patterns" that could not only slow down the mining process but also make feature selection hard to complete. In this paper, we propose a direct discriminative pattern mining approach, DDPMine, to tackle the efficiency issue arising from the two-step approach. DDPMine performs a branch-and-bound search for directly mining discriminative patterns without generating the complete pattern set. Instead of selecting best patterns in a batch, we introduce a "feature-centered" mining approach that generates discriminative patterns sequentially on a progressively shrinking FP-tree by incrementally eliminating training instances. The instance elimination effectively reduces the problem size iteratively and expedites the mining process. Empirical results show that DDPMine achieves orders of magnitude speedup without any downgrade of classification accuracy. It outperforms the state-of-the-art associative classification methods in terms of both accuracy and efficiency.
引用
收藏
页码:169 / +
页数:2
相关论文
共 50 条
  • [1] Discriminative frequent pattern analysis for effective classification
    Cheng, Hong
    Yan, Xifeng
    Han, Jiawei
    Hsu, Chih-Wei
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 691 - +
  • [2] Classification of Software Behaviors for Failure Detection: A Discriminative Pattern Mining Approach
    Lo, David
    Cheng, Hong
    Han, Jiawei
    Khoo, Siau-Cheng
    Sun, Chengnian
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 557 - 565
  • [3] NDPMine: Efficiently Mining Discriminative Numerical Features for Pattern-Based Classification
    Kim, Hyungsul
    Kim, Sangkyum
    Weninger, Tim
    Han, Jiawei
    Abdelzaher, Tarek
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II: EUROPEAN CONFERENCE, ECML PKDD 2010, 2010, 6322 : 35 - 50
  • [4] Discriminative Subsequence Mining for action classification
    Nowozin, Sebastian
    Bakir, Goekhan
    Tsuda, Koji
    2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 1727 - 1734
  • [5] A Genetic Algorithm for Discriminative Graph Pattern Mining
    Vaculik, Karel
    Popelinsky, Lubos
    IDEAS '19: PROCEEDINGS OF THE 23RD INTERNATIONAL DATABASE APPLICATIONS & ENGINEERING SYMPOSIUM (IDEAS 2019), 2019, : 339 - 340
  • [6] Discriminative pattern mining and its applications in bioinformatics
    Liu, Xiaoqing
    Wu, Jun
    Gu, Feiyang
    Wang, Jie
    He, Zengyou
    BRIEFINGS IN BIOINFORMATICS, 2015, 16 (05) : 884 - 900
  • [7] Conditional discriminative pattern mining: Concepts and algorithms
    He, Zengyou
    Gu, Feiyang
    Zhao, Can
    Liu, Xiaoqing
    Wu, Jun
    Wang, Ju
    INFORMATION SCIENCES, 2017, 375 : 1 - 15
  • [8] Constrained Logistic Regression for Discriminative Pattern Mining
    Anand, Rajul
    Reddy, Chandan K.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2011, 6911 : 92 - 107
  • [9] Discriminative Pattern Mining for Breast Cancer Histopathology Image Classification via Fully Convolutional Autoencoder
    Li, Xingyu
    Radulovic, Marko
    Kanjer, Ksenija
    Plataniotis, Konstantinos N.
    IEEE ACCESS, 2019, 7 : 36433 - 36445
  • [10] Discriminative Pattern Mining for Natural Language Metaphor Generation
    Brooks, Jennifer
    Youssef, Abdou
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4276 - 4283