An efficient discriminant analysis algorithm for document classification

被引:3
|
作者
Wang Z. [1 ]
Sun X. [1 ]
机构
[1] Henan University of Technology, Zhengzhou
关键词
Data mining; Dimensionality reduction; Document classification; Kernel discriminant analysis;
D O I
10.4304/jsw.6.7.1265-1272
中图分类号
学科分类号
摘要
Document categorization has become one of the most important research areas of pattern recognition and data mining due to the exponential growth of documents in the Internet and the emergent need to organize them. The document space is always of very high dimensionality and learning in such a high dimensional space is often impossible due to the curse of dimensionality. To cope with performance and accuracy problems with high dimensionality, a novel dimensionality reduction algorithm called IKDA is proposed in this paper. The proposed IKDA algorithm combines kernel-based learning techniques and direct iterative optimization procedure to deal with the nonlinearity of the document distribution. The proposed algorithm also effectively solves the so-called "small sample size" problem in document classification task. Extensive experimental results on two real world data sets demonstrate the effectiveness and efficiency of the proposed algorithm. © 2011 ACADEMY PUBLISHER.
引用
收藏
页码:1265 / 1272
页数:7
相关论文
共 50 条
  • [1] Kernel Discriminant Analysis Algorithm for Document Categorization
    Wang, Ziqiang
    Qian, Xu
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION WORKSHOP: IITA 2008 WORKSHOPS, PROCEEDINGS, 2008, : 601 - 604
  • [2] Efficient algorithm for kernel discriminant analysis
    Liang, Z
    Shi, P
    ELECTRONICS LETTERS, 2004, 40 (25) : 1579 - 1581
  • [3] A robust and efficient algorithm for bilevel document block classification
    Pappas, TN
    Tseng, SH
    Kosiba, DA
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2001, : 1122 - 1125
  • [4] An Efficient LDE-Based Document Classification Algorithm
    Wang, Ziqiang
    Sun, Xia
    PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 571 - 574
  • [5] An Efficient Document Classification Algorithm Based on Kernel LDE
    Sun, Xia
    Zhang, Qingzhou
    Wang, Ziqiang
    2009 INTERNATIONAL CONFERENCE ON INDUSTRIAL MECHATRONICS AND AUTOMATION, 2009, : 509 - 511
  • [6] A Flexible and Efficient Algorithm for Regularized Fisher Discriminant Analysis
    Zhang, Zhihua
    Dai, Guang
    Jordan, Michael I.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 632 - +
  • [7] Orthogonal Locality Discriminant Embedding for Document Classification
    Wang, Ziqiang
    Sun, Xia
    2009 FOURTH INTERNATIONAL CONFERENCE ON BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PROCEEDINGS, 2009, : 170 - 174
  • [8] An Efficient Web Document Classification Algorithm Based on LPP and SVM
    Wang, Ziqiang
    Liu, Yuxun
    Sun, Xia
    PROCEEDINGS OF THE 2008 CHINESE CONFERENCE ON PATTERN RECOGNITION (CCPR 2008), 2008, : 437 - 440
  • [9] Efficient and Robust Sparse Linear Discriminant Analysis for Data Classification
    Liu, Jingjing
    Feng, Manlong
    Xiu, Xianchao
    Liu, Wanquan
    Zeng, Xiaoyang
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 617 - 629
  • [10] A CLASSIFICATION ALGORITHM WITH LINEAR DISCRIMINANT ANALYSIS AND AXIOMATIC FUZZY SETS
    Jia, Wenjuan
    Deng, Yingjie
    Xin, Chenyang
    Liu, Xiaodong
    Pedrycz, Witold
    MATHEMATICAL FOUNDATIONS OF COMPUTING, 2019, 2 (01): : 73 - 81