A Bayesian Classification Approach Using Class-Specific Features for Text Categorization

被引:86
|
作者
Tang, Bo [1 ]
He, Haibo [1 ]
Baggenstoss, Paul M. [2 ]
Kay, Steven [1 ]
机构
[1] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
[2] Frauhnhofer FKIE, Fraunhoferstr 20, D-53343 Wachtberg, Germany
基金
美国国家科学基金会;
关键词
Feature selection; text categorization; class-specific features; PDF projection and estimation; naive Bayes; dimension reduction; SELECTION;
D O I
10.1109/TKDE.2016.2522427
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a Bayesian classification approach for automatic text categorization using class-specific features. Unlike conventional text categorization approaches, our proposed method selects a specific feature subset for each class. To apply these class-specific features for classification, we follow Baggenstoss's PDF Projection Theorem (PPT) to reconstruct the PDFs in raw data space from the class-specific PDFs in low-dimensional feature subspace, and build a Bayesian classification rule. One noticeable significance of our approach is that most feature selection criteria, such as Information Gain (IG) and Maximum Discrimination (MD), can be easily incorporated into our approach. We evaluate our method's classification performance on several real-world benchmarks, compared with the state-of-the-art feature selection approaches. The superior results demonstrate the effectiveness of the proposed approach and further indicate its wide potential applications in data mining.
引用
收藏
页码:1602 / 1606
页数:5
相关论文
共 50 条
  • [21] Sufficiency classification, and the class-specific feature theorem
    Kay, S
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2000, 46 (04) : 1654 - 1658
  • [22] BAYESIAN MULTINOMIAL REGRESSION WITH CLASS-SPECIFIC PREDICTOR SELECTION
    Gustafson, Paul
    Lefebvre, Genevieve
    ANNALS OF APPLIED STATISTICS, 2008, 2 (04): : 1478 - 1502
  • [23] Equivalence classification by California sea lions using class-specific reinforcers
    Kastak, CR
    Schusterman, RJ
    Kastak, D
    JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR, 2001, 76 (02) : 131 - 158
  • [24] Mining for class-specific motifs in protein sequence classification
    Satish M Srinivasan
    Suleyman Vural
    Brian R King
    Chittibabu Guda
    BMC Bioinformatics, 14
  • [25] Software design patterns classification and selection using text categorization approach
    Hussain, Shahid
    Keung, Jacky
    Khan, Arif Ali
    APPLIED SOFT COMPUTING, 2017, 58 : 225 - 244
  • [26] A Multi-Resolution Hidden Markov Model Using Class-Specific Features
    Baggenstoss, Paul M.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (10) : 5165 - 5177
  • [27] CNN Approaches to Classify Multivariate Time Series Using Class-specific Features
    Hao, Yifan
    Cao, Huiping
    Draayer, Erick
    2020 IEEE INTERNATIONAL CONFERENCE ON SMART DATA SERVICES (SMDS 2020), 2020, : 1 - 8
  • [28] LEARNING CLASS-SPECIFIC POOLING SHAPES FOR IMAGE CLASSIFICATION
    Wang, Jinzhuo
    Wang, Wenmin
    Wang, Ronggang
    Gao, Wen
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [29] Class-Specific Pre-trained Sparse Autoencoders for Learning Effective Features for Document Classification
    Abdulhussain, Maysa I.
    Gan, John Q.
    2016 8TH COMPUTER SCIENCE AND ELECTRONIC ENGINEERING CONFERENCE (CEEC), 2016, : 36 - 41
  • [30] Inherently Interpretable Multi-Label Classification Using Class-Specific Counterfactuals
    Sun, Susu
    Woerner, Stefano
    Maier, Andreas
    Koch, Lisa M.
    Baumgartner, Christian F.
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 937 - 956