Research on SVM-Based Automatic Classification of Chinese Web Page

被引:0
|
作者
Song, Jie [1 ]
Liu, Yanque [1 ]
Li, Nana [1 ]
Gu, Junhua [1 ]
机构
[1] Hebei Univ Technol, Coll Comp Sci & Software, Tianjin 300401, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper deals with Chinese web page classification based on support vector machine (SVM). Some methods are proposed for text extraction and Chinese word segment. And it discusses the different contribution to text classification on different locations of the page. The SVM classifier is applied on classification. The results showed that the performance of the classification has further improved, for the text without noisy blocks after extraction, high correct rate of Chinese word segment. In addition, picking the title, keywords and description out, and increasing its weighs can also improve the accuracy of classification.
引用
收藏
页码:160 / 164
页数:5
相关论文
共 50 条
  • [1] SVM based Chinese web page automatic classification
    Liang, JZ
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 2265 - 2268
  • [2] Web page classification based on SVM
    Xue, Weimin
    Bao, Hong
    Xue, Weimin
    Huang, Weitong
    Lu, Yuchang
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 6111 - +
  • [3] Research of SVM-based document classification
    Zhang, ZhenNan
    Xu, Qian
    Cui, Junbo
    Pu, Duan
    Information, Management and Algorithms, Vol II, 2007, : 260 - 263
  • [4] A Chinese Web Page Automatic Classification System
    Huang, Rongyou
    Zhao, Xinjian
    WEB INFORMATION SYSTEMS AND MINING, 2010, 6318 : 61 - +
  • [5] SVM-based automatic classification for protein structural domain
    Shao, Xiao-Han
    Tian, Ying-Jie
    Deng, Nai-Yang
    OPTIMIZATION AND SYSTEMS BIOLOGY, 2007, 7 : 341 - +
  • [6] Research of Chinese-text automatic classification based on SVM
    Coll. of Management, Univ. of Shanghai Science and Technology, Shanghai 200093, China
    Xi Tong Cheng Yu Dian Zi Ji Shu/Syst Eng Electron, 2007, 3 (475-478):
  • [7] Research on web page automatic classification based on internet news corpus
    Cai, Wei
    Wang, Yong-Cheng
    Yin, Zhong-Hang
    Journal of Shanghai Jiaotong University (Science), 2007, 12 E (06) : 731 - 735
  • [8] A Fuzzy Ontology and SVM-Based Web Content Classification System
    Ali, Farman
    Khan, Pervez
    Riaz, Kashif
    Kwak, Daehan
    Abuhmed, Tamer
    Park, Daeyoung
    Kwak, Kyung Sup
    IEEE ACCESS, 2017, 5 : 25781 - 25797
  • [9] Research on Pattern Classification of SVM-based Gait Signal
    Yin, Jing
    ADVANCES IN APPLIED SCIENCE AND INDUSTRIAL TECHNOLOGY, PTS 1 AND 2, 2013, 798-799 : 526 - 529
  • [10] A SVM-based Parser for Chinese
    Chan, Zhimin
    Feng, Cheng
    ITESS: 2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES, PT 1, 2008, : 1150 - 1156