A hybrid generative/discriminative approach to text classification with additional information

被引:13
|
作者
Fujino, Akinori [1 ]
Ueda, Naonori [1 ]
Saito, Kazumi [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Kyoto 6190237, Japan
关键词
multiclass and single-labeled text classification; multiple components; maximum entropy principle; Naive Bayes model;
D O I
10.1016/j.ipm.2006.07.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a classifier for text data samples consisting of main text and additional components, such as Web pages and technical papers. We focus on multiclass and single-labeled text classification problems and design the classifier based on a hybrid composed of probabilistic generative and discriminative approaches. Our formulation considers individual component generative models and constructs the classifier by combining these trained models based on the maximum entropy principle. We use naive Bayes models as the component generative models for the main text and additional components such as titles, links, and authors, so that we can apply our formulation to document and Web page classification problems. Our experimental results for four test collections confirmed that our hybrid approach effectively combined main text and additional components and thus improved classification performance. (c) 2006 Published by Elsevier Ltd.
引用
收藏
页码:379 / 392
页数:14
相关论文
共 50 条
  • [1] Scene classification using a hybrid generative/discriminative approach
    Bosch, Anna
    Zisserman, Andrew
    Munoz, Xavier
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (04) : 712 - 727
  • [2] Classification with hybrid generative/discriminative models
    Raina, R
    Shen, YR
    Ng, AY
    McCallum, A
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 545 - 552
  • [3] A generative-discriminative hybrid for sequential data classification
    Abou-Moustafa, KT
    Suen, CY
    Cheriet, M
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 805 - 808
  • [4] A Hybrid Discriminative/Generative Approach for Modeling Human Activities
    Lester, Jonathan
    Choudhury, Tanzeem
    Kern, Nicky
    Borriello, Gaetano
    Hannaford, Blake
    19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 766 - 772
  • [5] Using Hybrid Discriminative-Generative Models for Binary Classification
    Abroyan, N.
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2019, 53 (04) : 320 - 327
  • [6] A Hybrid Generative-Discriminative Approach to Speaker Diarization
    Noulas, Athanasios K.
    van Kasteren, Tim
    Kroese, Ben J. A.
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, PROCEEDINGS, 2008, 5237 : 98 - 109
  • [7] Hybrid Generative/Discriminative Approaches for Proportional Data Modeling and Classification
    Bouguila, Nizar
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (12) : 2184 - 2202
  • [8] Hybrid Generative-Discriminative Classification using Posterior Divergence
    Li, Xiong
    Lee, Tai Sing
    Liu, Yuncai
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [9] Using Hybrid Discriminative-Generative Models for Binary Classification
    N. Abroyan
    Automatic Control and Computer Sciences, 2019, 53 : 320 - 327
  • [10] A hybrid discriminative/generative approach to protein fold recognition
    Chmielnicki, Wieslaw
    Stapor, Katarzyna
    NEUROCOMPUTING, 2012, 75 (01) : 194 - 198