An efficient discriminant analysis algorithm for document classification

被引:3
|
作者
Wang Z. [1 ]
Sun X. [1 ]
机构
[1] Henan University of Technology, Zhengzhou
关键词
Data mining; Dimensionality reduction; Document classification; Kernel discriminant analysis;
D O I
10.4304/jsw.6.7.1265-1272
中图分类号
学科分类号
摘要
Document categorization has become one of the most important research areas of pattern recognition and data mining due to the exponential growth of documents in the Internet and the emergent need to organize them. The document space is always of very high dimensionality and learning in such a high dimensional space is often impossible due to the curse of dimensionality. To cope with performance and accuracy problems with high dimensionality, a novel dimensionality reduction algorithm called IKDA is proposed in this paper. The proposed IKDA algorithm combines kernel-based learning techniques and direct iterative optimization procedure to deal with the nonlinearity of the document distribution. The proposed algorithm also effectively solves the so-called "small sample size" problem in document classification task. Extensive experimental results on two real world data sets demonstrate the effectiveness and efficiency of the proposed algorithm. © 2011 ACADEMY PUBLISHER.
引用
收藏
页码:1265 / 1272
页数:7
相关论文
共 50 条
  • [41] Document Classification Algorithm Using Kernel LPP
    Wang, Ziqiang
    Qian, Xu
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NATURAL COMPUTING, VOL II, 2009, : 100 - 102
  • [42] A document page classification algorithm in copy piepeline
    Dong, X.
    Majewicz, P.
    McNutt, G.
    Bouman, C.
    Allebach, J.
    Pollak, I.
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 1365 - +
  • [43] Document Classification Algorithm Based on NPE and PSO
    Wang, Ziqiang
    Sun, Xia
    2009 INTERNATIONAL CONFERENCE ON E-BUSINESS AND INFORMATION SYSTEM SECURITY, VOLS 1 AND 2, 2009, : 161 - 164
  • [44] Hierarchical Document Classification based on a Backtracking Algorithm
    Zhu, Cuiling
    Ma, Jun
    Zhang, Dongmei
    Han, XiaoHui
    Han, Xiaofei
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 467 - +
  • [45] Efficient optimally regularized discriminant analysis
    Zhu, Lin
    Huang, De-Shuang
    NEUROCOMPUTING, 2013, 117 : 12 - 21
  • [46] An efficient kernel discriminant analysis method
    Lu, JW
    Plataniotis, KN
    Venetsanopoulos, A
    Wang, J
    PATTERN RECOGNITION, 2005, 38 (10) : 1788 - 1790
  • [47] A modified algorithm for generalized discriminant analysis
    Zheng, WM
    Zhao, L
    Zou, CR
    NEURAL COMPUTATION, 2004, 16 (06) : 1283 - 1297
  • [48] Classification of Cognitive Impairment Using Quadratic Discriminant Analysis Based Spiral Dynamic Optimization Algorithm
    Shanthi, A. S.
    Immanuel, Jebakumar D.
    Selvakumar, P.
    Gugan, I.
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2024, 19 (08) : 5313 - 5326
  • [49] The Application of Cluster Analysis and Discriminant Analysis in the Classification of Countries
    Wang Airu
    Li Chunlan
    Qiao Junjian
    COMPREHENSIVE EVALUATION OF ECONOMY AND SOCIETY WITH STATISTICAL SCIENCE, 2009, : 1194 - 1198
  • [50] Efficient and robust discriminant dictionary pair learning for pattern classification
    Wang, Yuxi
    Du, Haishun
    Zhang, Yonghao
    Zhang, Yanyu
    DIGITAL SIGNAL PROCESSING, 2021, 118