A wrapper approach with support vector machines for text categorization

被引:0
|
作者
Montanés, E [1 ]
Quevedo, JR [1 ]
Díaz, I [1 ]
机构
[1] Univ Oviedo, Ctr Artificial Intelligence, Gijon, Asturias, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text Categorization (TC)-the assignment of predefined categories to documents of a corpus-plays an important role in a wide variety of information organization and management tasks of Information Retrieval (IR). It involves the management of a lot of information, but some of them could be noisy or irrelevant and hence, a previous feature reduction could improve the performance of the classification. In this paper we proposed a wrapper approach. This kind of approach is time-consuming and sometimes could be infeasible. But our wrapper explores a reduced number of feature subsets and also it uses Support Vector Machines (SVM) as the evaluation system; and this two properties make the wrapper fast enough to deal with large number of features present in text domains. Taking the Reuters-21578 corpus, we also compare this wrapper with the common approach for feature reduction widely applied in TC, which consists of filtering according to scoring measures.
引用
收藏
页码:230 / 237
页数:8
相关论文
共 50 条
  • [31] Application for Web Text Categorization Based on Support Vector Machine
    Pan Hao
    Duan Ying
    Tan Longyuan
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2009, : 42 - 45
  • [32] One-against-one fuzzy support vector machine classifier: An approach to text categorization
    Wang, Tai-Yue
    Chiang, Huei-Min
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (06) : 10030 - 10034
  • [34] Virtual examples for text classification with support vector machines
    Sassano, M
    PROCEEDINGS OF THE 2003 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2003, : 208 - 215
  • [35] Text classification of news articles with support vector machines
    Paass, G
    Kindermann, J
    Leopold, E
    TEXT MINING AND ITS APPLICATIONS, 2004, 138 : 53 - 64
  • [36] Dimension reduction in text classification with support vector machines
    Kim, H
    Howland, P
    Park, H
    JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 37 - 53
  • [37] Hierarchical text classification based on support vector machines
    Jin, Ting
    Lei, Jingsheng
    Journal of Information and Computational Science, 2009, 6 (01): : 543 - 551
  • [38] Weighted Transductive Support Vector Machines for text classification
    Liu, Shuang
    Jia, Chuanying
    Ma, Heng
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 445 - 449
  • [39] A multiple classifier approach for measuring text relatedness based on support vector machines techniques
    Lee, Chung-Hong
    Yang, Hsin-Chang
    Hsu, Feng-Chih
    Chen, TingChung
    Hung, Chin-Cheng
    WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 1, 2005, : 242 - 246
  • [40] Color photo categorization using compressed histograms and support vector machines
    Feng, X
    Fang, JZ
    Qiu, GP
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 753 - 756