A Bayesian Classification Approach Using Class-Specific Features for Text Categorization

被引:86
|
作者
Tang, Bo [1 ]
He, Haibo [1 ]
Baggenstoss, Paul M. [2 ]
Kay, Steven [1 ]
机构
[1] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
[2] Frauhnhofer FKIE, Fraunhoferstr 20, D-53343 Wachtberg, Germany
基金
美国国家科学基金会;
关键词
Feature selection; text categorization; class-specific features; PDF projection and estimation; naive Bayes; dimension reduction; SELECTION;
D O I
10.1109/TKDE.2016.2522427
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a Bayesian classification approach for automatic text categorization using class-specific features. Unlike conventional text categorization approaches, our proposed method selects a specific feature subset for each class. To apply these class-specific features for classification, we follow Baggenstoss's PDF Projection Theorem (PPT) to reconstruct the PDFs in raw data space from the class-specific PDFs in low-dimensional feature subspace, and build a Bayesian classification rule. One noticeable significance of our approach is that most feature selection criteria, such as Information Gain (IG) and Maximum Discrimination (MD), can be easily incorporated into our approach. We evaluate our method's classification performance on several real-world benchmarks, compared with the state-of-the-art feature selection approaches. The superior results demonstrate the effectiveness of the proposed approach and further indicate its wide potential applications in data mining.
引用
收藏
页码:1602 / 1606
页数:5
相关论文
共 50 条
  • [41] Time series classification by class-specific Mahalanobis distance measures
    Zoltán Prekopcsák
    Daniel Lemire
    Advances in Data Analysis and Classification, 2012, 6 : 185 - 200
  • [42] Class-specific correction and classification of NIR spectra of edible oils
    Alagappan, Lakshmi
    Chu, Jia En
    Chua, Joanna Huixin
    Ding, Jia Wen
    Xiao, Ronghui
    Yu, Zhe
    Pan, Kun
    Elejalde, Untzizu
    Lim, Kevin Junliang
    Wong, Limsoon
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2023, 241
  • [43] CLASS-SPECIFIC MODEL MIXTURES FOR THE CLASSIFICATION OF TIME-SERIES
    Baggenstoss, Paul M.
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2341 - 2345
  • [44] Class-Specific Guided Local Feature Selection for Data Classification
    Qian, Youcheng
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2019, : 645 - 649
  • [45] Class-Specific Sparse Principal Component Analysis for Visual Classification
    Pan, Fei
    Zhang, Zai-Xu
    Liu, Bao-Di
    Xie, Ji-Jun
    IEEE ACCESS, 2020, 8 : 110033 - 110047
  • [46] Minimum class variance class-specific extreme learning machine for imbalanced classification
    Raghuwanshi, Bhagat Singh
    Shukla, Sanyam
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 178
  • [47] Time series classification by class-specific Mahalanobis distance measures
    Prekopcsak, Zoltan
    Lemire, Daniel
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2012, 6 (03) : 185 - 200
  • [48] Using Class Based Document Frequency to Select Features in Text Classification
    Li, Baoli
    Yan, Qiuling
    Han, Liping
    BIG DATA TECHNOLOGY AND APPLICATIONS, 2016, 590 : 200 - 210
  • [49] Class-Specific Mahalanobis Distance Metric Learning for Biological Image Classification
    Mohan, B. S. Shajee
    Sekhar, C. Chandra
    IMAGE ANALYSIS AND RECOGNITION, PT II, 2012, 7325 : 240 - 248
  • [50] Class-specific Artificial Immune Recognition method for Hyperspectral Image classification
    Meng, Qingjie
    Zhang, Yanning
    Weiwei
    Ren, Yuemei
    She, Hongwei
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 851 - +