Direct discriminative pattern mining for effective classification

被引:0
|
作者
Cheng, Hong [1 ]
Yan, Xifeng [2 ]
Han, Jiawei [1 ]
Yu, Philip S. [3 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
[2] IBM Corp, T J Watson Res Ctr, Hawthorne, NY 10504 USA
[3] Univ Illinois, Chicago, IL 60680 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The application of frequent patterns in classification has demonstrated its power in recent studies. It often adopts a two-step approach: frequent pattern (or classification rule) mining followed by feature selection (or rule ranking). However, this two-step process could be computationally expensive, especially when the problem scale is large or the minimum support is low. It was observed that frequent pattern mining usually produces a huge number of "patterns" that could not only slow down the mining process but also make feature selection hard to complete. In this paper, we propose a direct discriminative pattern mining approach, DDPMine, to tackle the efficiency issue arising from the two-step approach. DDPMine performs a branch-and-bound search for directly mining discriminative patterns without generating the complete pattern set. Instead of selecting best patterns in a batch, we introduce a "feature-centered" mining approach that generates discriminative patterns sequentially on a progressively shrinking FP-tree by incrementally eliminating training instances. The instance elimination effectively reduces the problem size iteratively and expedites the mining process. Empirical results show that DDPMine achieves orders of magnitude speedup without any downgrade of classification accuracy. It outperforms the state-of-the-art associative classification methods in terms of both accuracy and efficiency.
引用
收藏
页码:169 / +
页数:2
相关论文
共 50 条
  • [41] Protein classification using sequential pattern mining
    Exarchos, Themis P.
    Papaloukas, Costas
    Lampros, Christos
    Fotiadis, Dimitrios I.
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 1436 - +
  • [42] Pattern Mining of Multichannel sEMG for Tremor Classification
    Palmes, Paulito
    Ang, Wei Tech
    Widjaja, Ferdinan
    Tan, Louis C. S.
    Au, Wing Lok
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2010, 57 (12) : 2795 - 2805
  • [43] Discretized sequential pattern mining for behaviour classification
    Buffett, Scott
    GRANULAR COMPUTING, 2021, 6 (04) : 853 - 866
  • [44] HEp-2 cell pattern classification with discriminative dictionary learning
    Kong, Xiangfei
    Li, Kuan
    Cao, Jingjing
    Yang, Qingxiong
    Liu Wenyin
    PATTERN RECOGNITION, 2014, 47 (07) : 2379 - 2388
  • [45] An Effective and Discriminative Feature Learning for URL based Web Page Classification
    Rajalakshmi, R.
    Aravindan, Chandrabose
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 1374 - 1379
  • [46] LCMine: An efficient algorithm for mining discriminative regularities and its application in supervised classification
    Garcia-Borroto, Milton
    Fco Martinez-Trinidad, Jose
    Ariel Carrasco-Ochoa, Jesus
    Angel Medina-Perez, Miguel
    Ruiz-Shulcloper, Jose
    PATTERN RECOGNITION, 2010, 43 (09) : 3025 - 3034
  • [47] Effective periodic pattern mining in time series databases
    Nishi, Manziba Akanda
    Ahmed, Chowdhury Farhan
    Samiullah, Md.
    Jeong, Byeong-Soo
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (08) : 3015 - 3027
  • [48] A Parallel Direct-Vertical Map Reduce Programming model for an effective frequent pattern mining in a dispersed environment
    Yamuna Devi, N.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (24):
  • [49] Adaptive Template Reconstruction for Effective Pattern Classification
    Yang, Su
    Hoque, Sanaul
    Deravi, Farzin
    SENSORS, 2023, 23 (15)
  • [50] An effective feature set for ECG pattern classification
    Ghongade, Rajesh
    Ghatol, Ashok
    MEDICAL BIOMETRICS, PROCEEDINGS, 2007, 4901 : 25 - +