SEGMENTATION OF CHINESE TEXT FOR WEB CONTENT FILTERING

被引:0
|
作者
Hui, S. C. [1 ]
Fong, A. C. M. [2 ]
Hong, G. Y. [3 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
[2] Auckland Univ Technol, Sch Comp & Math Sci, Auckland, New Zealand
[3] Unitec Inst Technol, Dept Comp, Auckland, New Zealand
关键词
Web content filtering; Text segmentation; information processing;
D O I
暂无
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
We have been engaged in the development of an effective English and Chinese bilingual web content categorization engine for some years. Due to the nature of the two languages, the processing algorithms for the two languages also differ significantly. In this paper, we evaluate a number of segmentation methods for Chinese text with the expressed purposes of analyzing web textual content information for effective web content filtering. Based on the evaluation results, a specific method is adapted for the task.
引用
收藏
页码:641 / +
页数:2
相关论文
共 50 条
  • [1] Segmentation of Chinese Text for Web Content Filtering
    Hui, S. C.
    Fong, A. C. M.
    Hong, G. Y.
    2011 INTERNATIONAL CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND AUTOMATION (CCCA 2011), VOL I, 2010, : 50 - 53
  • [2] Segmentation of Chinese Web text based on Spark
    Xu, Jiazhen
    2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2015, : 200 - 203
  • [3] Content-based text classiriers for pornographic web filtering
    Polpinij, Jantima
    Chotthanom, Anirut
    Sibunruang, Chumsak
    Chamchong, Rapeepom
    Puangpronpitag, Somnuk
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 1481 - +
  • [4] Text Classification Models for Web Content Filtering and Online Safety
    Liu, Shuhua
    Forss, Thomas
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 961 - 968
  • [5] INFORMATION EXTRACTION VERSUS TEXT SEGMENTATION FOR WEB CONTENT MINING
    Fragkou, Pavlina
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2013, 23 (08) : 1109 - 1137
  • [6] Web Content Filtering
    Gomez Hidalgo, Jose Maria
    Puertas Sanz, Enrique
    Carrero Garcia, Francisco
    De Buenaga Rodriguez, Manuel
    ADVANCES IN COMPUTERS: SOCIAL NETWORKING AND THE WEB, VOL 76, 2009, 76 : 257 - 306
  • [7] Web filtering using text classification
    Du, RB
    Safavi-Naini, R
    Susilo, W
    ICON 2003: 11TH IEEE INTERNATIONAL CONFERENCE ON NETWORKS, 2003, : 325 - 330
  • [8] Filtering of text blocks in Web images
    Chin, S
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 1037 - 1041
  • [9] Segmentation of Chinese Handwritten Text
    Cao Xinyan
    Zou Yingyong
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 367 - 370
  • [10] Application of Chinese Word Segmentation Based on Linguistic Environment Analysis in Text Information Filtering System
    Yi, Zhi-an
    Lv, Jia
    ICECT: 2009 INTERNATIONAL CONFERENCE ON ELECTRONIC COMPUTER TECHNOLOGY, PROCEEDINGS, 2009, : 467 - 470