SEGMENTATION OF CHINESE TEXT FOR WEB CONTENT FILTERING

被引:0
|
作者
Hui, S. C. [1 ]
Fong, A. C. M. [2 ]
Hong, G. Y. [3 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
[2] Auckland Univ Technol, Sch Comp & Math Sci, Auckland, New Zealand
[3] Unitec Inst Technol, Dept Comp, Auckland, New Zealand
来源
2011 INTERNATIONAL CONFERENCE ON MECHANICAL ENGINEERING AND TECHNOLOGY (ICMET 2011) | 2011年
关键词
Web content filtering; Text segmentation; information processing;
D O I
暂无
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
We have been engaged in the development of an effective English and Chinese bilingual web content categorization engine for some years. Due to the nature of the two languages, the processing algorithms for the two languages also differ significantly. In this paper, we evaluate a number of segmentation methods for Chinese text with the expressed purposes of analyzing web textual content information for effective web content filtering. Based on the evaluation results, a specific method is adapted for the task.
引用
收藏
页码:641 / +
页数:2
相关论文
共 50 条
  • [21] KNN algorithm on Chinese erotic text filtering
    Su, Gui-Yang
    Li, Jian-Hua
    Ma, Ying-Hua
    Li, Sheng-Hong
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2004, 38 (SUPPL. 2): : 76 - 79
  • [22] Two approaches for text segmentation in web images
    Karatzas, D
    Antonacopoulos, A
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 131 - 136
  • [23] Online classifiers for Chinese Text Classification and Filtering
    Guo, YH
    Liu, JY
    Wang, C
    Zhong, YX
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 656 - 662
  • [24] Text classification for Chinese web documents
    Hu, Ming
    Xu, Jianchao
    Hu, Liang
    COMPUTATIONAL METHODS, PTS 1 AND 2, 2006, : 1171 - +
  • [25] Customer segmentation by web content mining
    Zhou, Jinfeng
    Wei, Jinliang
    Xu, Bugao
    JOURNAL OF RETAILING AND CONSUMER SERVICES, 2021, 61
  • [26] Mining web data for Chinese segmentation
    Wang, Fu Lee
    Yang, Christopher C.
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (12): : 1820 - 1837
  • [27] Text categorization in an intelligent agent for filtering information on the Web
    Gentili, GL
    Marinilli, M
    Micarelli, A
    Sciarrone, F
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2001, 15 (03) : 527 - 549
  • [28] Web sensitive text filtering by combing semantics and statistics
    Wu, O
    Hu, WM
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 663 - 667
  • [29] Segmentation and recognition of continuous handwriting Chinese text
    Hong, C
    Loudon, G
    Wu, YM
    Zitserman, R
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1998, 12 (02) : 223 - 232
  • [30] An Evolutionary Approach to Automatic Chinese Text Segmentation
    Zhang, Dong
    2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 771 - 776