A wrapper approach with support vector machines for text categorization

被引:0
|
作者
Montanés, E [1 ]
Quevedo, JR [1 ]
Díaz, I [1 ]
机构
[1] Univ Oviedo, Ctr Artificial Intelligence, Gijon, Asturias, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text Categorization (TC)-the assignment of predefined categories to documents of a corpus-plays an important role in a wide variety of information organization and management tasks of Information Retrieval (IR). It involves the management of a lot of information, but some of them could be noisy or irrelevant and hence, a previous feature reduction could improve the performance of the classification. In this paper we proposed a wrapper approach. This kind of approach is time-consuming and sometimes could be infeasible. But our wrapper explores a reduced number of feature subsets and also it uses Support Vector Machines (SVM) as the evaluation system; and this two properties make the wrapper fast enough to deal with large number of features present in text domains. Taking the Reuters-21578 corpus, we also compare this wrapper with the common approach for feature reduction widely applied in TC, which consists of filtering according to scoring measures.
引用
收藏
页码:230 / 237
页数:8
相关论文
共 50 条
  • [41] SVM categorizer: A generic categorization tool using support vector machines
    Kapoutsis, E
    Theodoulidis, B
    Saraee, M
    IC-AI '04 & MLMTA'04 , VOL 1 AND 2, PROCEEDINGS, 2004, : 1109 - 1112
  • [42] Fuzzy support vector machine for multi-class text categorization
    Wang, Tai-Yue
    Chiang, Huei-Min
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (04) : 914 - 929
  • [43] A method of Chinese text categorization based on proximal support vector machine
    Zhou, JG
    Wang, K
    Wu, J
    Yan, PL
    Wu, M
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 1615 - 1619
  • [44] Evolutionary wrapper approaches for training set selection as preprocessing mechanism for support vector machines: Experimental evaluation and support vector analysis
    Verbiest, Nele
    Derrac, Joaquin
    Cornelis, Chris
    Garcia, Salvador
    Herrera, Francisco
    APPLIED SOFT COMPUTING, 2016, 38 : 10 - 22
  • [45] WORD COMBINATION KERNEL FOR TEXT CLASSIFICATION WITH SUPPORT VECTOR MACHINES
    Zhang, Lujiang
    Hu, Xiaohui
    COMPUTING AND INFORMATICS, 2013, 32 (04) : 877 - 896
  • [46] Text classification: neural networks vs support vector machines
    Zaghloul, Waleed
    Lee, Sang M.
    Trimi, Silvana
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2009, 109 (5-6) : 708 - 717
  • [47] Korean text chunk identification using Support Vector Machines
    Kim, Sang-Soo
    Son, Jeong woo
    Kong, Mi-hwa
    Park, Seong-Bae
    Lee, Sang-Jo
    THIRD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, PROCEEDINGS, 2006, : 674 - +
  • [48] Offline Handwritten Text Recognition Using Support Vector Machines
    Rajnoha, Martin
    Burget, Radim
    Dutta, Malay Kishore
    2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 132 - 136
  • [49] Representative sampling for text classification using support vector machines
    Xu, Z
    Yu, K
    Tresp, V
    Xu, XW
    Wang, JZ
    ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 393 - 407
  • [50] Research and improvement of text categorisation based on support vector machines
    Xie, J. B.
    Hou, Y. J.
    Xie, G. Y.
    Xie, G. F.
    INFORMATION SCIENCE AND ELECTRONIC ENGINEERING, 2017, : 207 - 212