SVM based Chinese web page automatic classification

被引:4
|
作者
Liang, JZ [1 ]
机构
[1] Zhejiang Normal Univ, Inst Comp Sci, Jinhua 321004, Peoples R China
关键词
support vector machine; statistic learning; web page; text classification; pattern recognition;
D O I
10.1109/ICMLC.2003.1259884
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper deals with Chinese web page classification based on support vector machine (SVM). First, Some methods are proposed for feature extraction and selection based on textual keywords. Then Special problems are discussed on statistic learning theory, support vector machine and their application in classification. Quadratic program algorithm is also described for constructing the SVM classifier. In the experiment part, the sample set, including 5096 samples, is chosen from the web version of Chinese People's Daily. It is separated into two sets, the training set with 3398 samples and the test set with 1698 samples. Two kinds of kernel function, polynomial and radial basis function, are considered in constructing the SVM classifier. The final classification correct rates are 89.81%, 86.51% for the two classifiers, respectively.
引用
收藏
页码:2265 / 2268
页数:4
相关论文
共 50 条
  • [21] Automatic Classification of Galaxies Based on SVM
    Bastanfard, Azam
    Amirkhani, Dariush
    Abbasiasl, Moslem
    2019 9TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE 2019), 2019, : 32 - 39
  • [22] A Novel Feature Selection Framework for Automatic Web Page Classification
    J.Alamelu Mangai
    V.Santhosh Kumar
    S.Appavu alias Balamurugan
    International Journal of Automation and Computing, 2012, (04) : 442 - 448
  • [23] A Novel Feature Selection Framework for Automatic Web Page Classification
    JAlamelu Mangai
    VSanthosh Kumar
    SAppavu alias Balamurugan
    International Journal of Automation & Computing , 2012, (04) : 442 - 448
  • [24] A Novel Approach to Naive Bayes Web Page Automatic Classification
    He, Zhongli
    Liu, Zhijing
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 361 - 365
  • [25] A Novel Feature Selection Framework for Automatic Web Page Classification
    Mangai, J. Alamelu
    Kumar, V. Santhosh
    Balamurugan, S. Appavu Alias
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2012, 9 (04) : 442 - 448
  • [26] Web document classification based on SVM
    Niu, Qiang
    Wang, Zhixiao
    Chen, Dai
    DCABES 2006 PROCEEDINGS, VOLS 1 AND 2, 2006, : 619 - 622
  • [27] A Novel Voting Algorithm of Multi-Class SVM for Web Page Classification
    Thamrongrat, Pompon
    Preechaveerakul, Ladda
    Wettayaprasit, Wiphada
    2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 2, 2009, : 327 - +
  • [28] Chinese web page classification based on self-organizing mapping neural networks
    Liang, JZ
    ICCIMA 2003: FIFTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2003, : 96 - 101
  • [29] A Comparison of Approaches to Semi-supervised Multiclass SVM for Web Page Classification
    Zubiaga, Arkaitz
    Fresno, Victor
    Martinez, Raquel
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2009, (42): : 63 - 70
  • [30] Multi-class SVM with negative data selection for web page classification
    Chen, CM
    Lee, HM
    Kao, MT
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 2047 - 2052