SVM based Chinese web page automatic classification

被引:4
|
作者
Liang, JZ [1 ]
机构
[1] Zhejiang Normal Univ, Inst Comp Sci, Jinhua 321004, Peoples R China
关键词
support vector machine; statistic learning; web page; text classification; pattern recognition;
D O I
10.1109/ICMLC.2003.1259884
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper deals with Chinese web page classification based on support vector machine (SVM). First, Some methods are proposed for feature extraction and selection based on textual keywords. Then Special problems are discussed on statistic learning theory, support vector machine and their application in classification. Quadratic program algorithm is also described for constructing the SVM classifier. In the experiment part, the sample set, including 5096 samples, is chosen from the web version of Chinese People's Daily. It is separated into two sets, the training set with 3398 samples and the test set with 1698 samples. Two kinds of kernel function, polynomial and radial basis function, are considered in constructing the SVM classifier. The final classification correct rates are 89.81%, 86.51% for the two classifiers, respectively.
引用
收藏
页码:2265 / 2268
页数:4
相关论文
共 50 条
  • [41] Web page classification based on a simplified swarm optimization
    Lee, Ji-Hyun
    Yeh, Wei-Chang
    Chuang, Mei-Chi
    APPLIED MATHEMATICS AND COMPUTATION, 2015, 270 : 13 - 24
  • [42] A Tool for Link-Based Web Page Classification
    Hernandez, Inma
    Rivero, Carlos R.
    Ruiz, David
    Corchuelo, Rafael
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 7023 : 443 - 452
  • [43] Artificial Immune System Based Web Page Classification
    Onan, Aytug
    SOFTWARE ENGINEERING IN INTELLIGENT SYSTEMS (CSOC2015), VOL 3, 2015, 349 : 189 - 199
  • [44] Web Page Classification Method Based on Semantics and Structure
    Li, Huaxin
    Zhang, Zhaoxin
    Xu, Yongdong
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2019), 2019, : 238 - 243
  • [45] Web Page Classification Algorithm Based on Deep Learning
    Yu, Yuanhui
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [46] Web Page Element Classification Based on Visual Features
    Burget, Radek
    Rudolfova, Ivana
    2009 FIRST ASIAN CONFERENCE ON INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2009, : 67 - 72
  • [47] Web Page Classification Based on Graph Neural Network
    Guo, Tao
    Cui, Baojiang
    INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS 2021, 2022, 279 : 188 - 198
  • [48] Dictionary-based Bilingual Web Page Classification
    Liu, Jicheng
    Liang, Chunyan
    Qi, Jianxun
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11542 - 11545
  • [49] A web page classification algorithm based on feature selection
    Zhou, Hongfang
    Guo, Jie
    Wang, Xinyi
    Duan, Wencong
    Wang, Peng
    Cao, Wenquan
    Journal of Information and Computational Science, 2015, 12 (04): : 1549 - 1556
  • [50] A Clique Based Web Page Classification Corrective Approach
    Belmouhcine, Abdelbadie
    Benkhalifa, Mohammed
    2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 2, 2014, : 467 - 473