Examples Initialization in Chinese Text Categorization

被引:0
|
作者
Cheng, Shi [1 ,2 ]
Shi, Yuhui [2 ]
Qin, Quande [3 ]
机构
[1] Univ Liverpool, Dept Elect Engn & Elect, Liverpool, Merseyside, England
[2] Xian Jiaotong Liverpool Univ, Dept Elect & Elect Engn, Suzhou, Peoples R China
[3] Shenzhen Univ, Coll Management, Shenzhen, Peoples R China
关键词
PARTICLE SWARM OPTIMIZATION; POPULATION DIVERSITY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The generalization ability is a fundamental goal for a classifier in machine learning. The categorization results are influenced by the initialized examples in a nearest neighbor classifier. The generalization ability beyond the examples in training set is important in categorization. In this paper, we propose a particle swarm optimization with k means clustering algorithm for the nearest neighbor classifier's examples initialization to improve categorization performances. This classifier utilizes an iterative strategy, and the classifier's example initialization is based on clusters center and random examples. The new classifier is tested on a Chinese text corpus. The proposed classifier is compared against the nearest neighbor classifier with random initialization.
引用
收藏
页码:967 / 971
页数:5
相关论文
共 50 条
  • [1] Improving linear classifier for Chinese text categorization
    Tsay, JJ
    Wang, JD
    INFORMATION PROCESSING & MANAGEMENT, 2004, 40 (02) : 223 - 237
  • [2] Distributional character clustering for chinese text categorization
    Zhou, XZ
    Wu, ZH
    PRICAI 2004: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3157 : 575 - 584
  • [3] Learning effective features for Chinese text categorization
    Luo, DS
    Wang, XH
    Wu, XH
    Chi, HS
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 608 - 613
  • [4] Improving Chinese text categorization by outlier learning
    Wang, XH
    Luo, DS
    Wu, XH
    Chi, HS
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 602 - 607
  • [5] Chinese text categorization based on CCIPCA and SMO
    Li, Xin-Fu
    He, Hai-Bin
    Zhao, Lei-Lei
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2514 - 2518
  • [6] A study on feature weighting in Chinese text categorization
    Xue, DJ
    Sun, MS
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 592 - 601
  • [7] Experimental study on representing units in Chinese text categorization
    Li, BL
    Chen, YZ
    Bai, XJ
    Yu, SW
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 602 - 614
  • [8] Using maximum entropy model for Chinese text categorization
    Li, RL
    Tao, XP
    Tang, L
    Hu, YF
    ADVANCED WEB TECHNOLOGIES AND APPLICATIONS, 2004, 3007 : 578 - 587
  • [9] A high performance prototype system for Chinese text categorization
    Fan, Xinghua
    MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 1017 - 1026
  • [10] Chinese text categorization based on alternative covering algorithm
    Key Laboratory of Intelligent Computing and Signal Processing Ministry of Education, Anhui University, Hefei 230039, China
    不详
    Jisuanji Gongcheng, 2006, 19 (183-184):