Research on SVM-Based Automatic Classification of Chinese Web Page

被引:0
|
作者
Song, Jie [1 ]
Liu, Yanque [1 ]
Li, Nana [1 ]
Gu, Junhua [1 ]
机构
[1] Hebei Univ Technol, Coll Comp Sci & Software, Tianjin 300401, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper deals with Chinese web page classification based on support vector machine (SVM). Some methods are proposed for text extraction and Chinese word segment. And it discusses the different contribution to text classification on different locations of the page. The SVM classifier is applied on classification. The results showed that the performance of the classification has further improved, for the text without noisy blocks after extraction, high correct rate of Chinese word segment. In addition, picking the title, keywords and description out, and increasing its weighs can also improve the accuracy of classification.
引用
收藏
页码:160 / 164
页数:5
相关论文
共 50 条
  • [41] Automatic classification of academic web page types
    Kenekayoro, Patrick
    Buckley, Kevan
    Thelwall, Mike
    SCIENTOMETRICS, 2014, 101 (02) : 1015 - 1026
  • [42] Research and Implementation of Real-time Automatic Web Page Classification System
    Han, Weihong
    Zhu, Weihui
    Jia, Yan
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MATERIAL, MECHANICAL AND MANUFACTURING ENGINEERING, 2015, 27 : 977 - 982
  • [43] An Accurate SVM-Based Classification Approach for Hyperspectral Image Classification
    Baassou, Belkacem
    He, Mingyi
    Mei, Shaohui
    2013 21ST INTERNATIONAL CONFERENCE ON GEOINFORMATICS (GEOINFORMATICS), 2013,
  • [44] Research on Classification of Chinese Text Data Based on SVM
    Lin, Yuan
    Yu, Hongzhi
    Wan, Fucheng
    Xu, Tao
    2017 2ND INTERNATIONAL SEMINAR ON ADVANCES IN MATERIALS SCIENCE AND ENGINEERING, 2017, 231
  • [45] SVM-based Automatic Annotation of Multiple Sequence Alignments
    Ren, Jiansi
    JOURNAL OF COMPUTERS, 2014, 9 (05) : 1109 - 1116
  • [46] Chinese web-page classification study
    Huang, Weitong
    Lu-Xiong Xu
    Duan, Junfeng
    Lu, Yuchang
    2007 IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1-7, 2007, : 2141 - +
  • [47] Research on Web Page Classification Method Based on Query Log
    Ye F.
    Ma Y.
    Journal of Shanghai Jiaotong University (Science), 2018, 23 (3) : 404 - 410
  • [48] Research on Web Page Classification Method Based on Query Log
    叶飞跃
    马祎星
    Journal of Shanghai Jiaotong University(Science), 2018, 23 (03) : 404 - 410
  • [49] Research on SVM-based MRI image segmentation
    Liu, Yu-Ting
    Zhang, Hong-Xin
    Li, Pei-Hua
    Journal of China Universities of Posts and Telecommunications, 2011, 18 (SUPPL.2): : 129 - 132
  • [50] Research on web page classification-based core characteristics and web structure
    Zengmin, Geng
    Jianxia, Du
    International Journal of Wireless and Mobile Computing, 2014, 7 (03) : 253 - 257