An efficient discriminant analysis algorithm for document classification

被引:3
|
作者
Wang Z. [1 ]
Sun X. [1 ]
机构
[1] Henan University of Technology, Zhengzhou
关键词
Data mining; Dimensionality reduction; Document classification; Kernel discriminant analysis;
D O I
10.4304/jsw.6.7.1265-1272
中图分类号
学科分类号
摘要
Document categorization has become one of the most important research areas of pattern recognition and data mining due to the exponential growth of documents in the Internet and the emergent need to organize them. The document space is always of very high dimensionality and learning in such a high dimensional space is often impossible due to the curse of dimensionality. To cope with performance and accuracy problems with high dimensionality, a novel dimensionality reduction algorithm called IKDA is proposed in this paper. The proposed IKDA algorithm combines kernel-based learning techniques and direct iterative optimization procedure to deal with the nonlinearity of the document distribution. The proposed algorithm also effectively solves the so-called "small sample size" problem in document classification task. Extensive experimental results on two real world data sets demonstrate the effectiveness and efficiency of the proposed algorithm. © 2011 ACADEMY PUBLISHER.
引用
收藏
页码:1265 / 1272
页数:7
相关论文
共 50 条
  • [21] On sparse linear discriminant analysis algorithm for high-dimensional data classification
    Ng, Michael K.
    Liao, Li-Zhi
    Zhang, Leihong
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2011, 18 (02) : 223 - 235
  • [22] Spatially-Regularized Manifold Discriminant Analysis Algorithm for Hyperspectral Image Classification
    Huang Hong
    Wang Lihua
    Shi Guangyao
    ACTA OPTICA SINICA, 2020, 40 (02)
  • [23] TopicBERT for Energy Efficient Document Classification
    Chaudhary, Yatin
    Gupta, Pankaj
    Saxena, Khushbu
    Kulkarni, Vivek
    Runkler, Thomas
    Schutze, Hinrich
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1682 - 1690
  • [24] AN EFFICIENT GREEDY SEARCH ALGORITHM FOR HIGH-DIMENSIONAL LINEAR DISCRIMINANT ANALYSIS
    Yang, Hannan
    Lin, Danyu
    Li, Quefeng
    STATISTICA SINICA, 2023, 33 : 1343 - 1364
  • [25] A genetic algorithm for discriminant analysis
    Daniel G. Conway
    A. Victor Cabot
    M.A. Venkataramanan
    Annals of Operations Research, 1998, 78 (0) : 71 - 82
  • [26] An algorithm for nonmetric discriminant analysis
    Choulakian, V
    Almhana, J
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2001, 35 (03) : 253 - 264
  • [27] Bat algorithm for variable selection in multivariate classification modeling using linear discriminant analysis
    Souza, Juliana da Cruz
    Soares, Sofacles F. C.
    de Paula, Lauro Cassio M.
    Coelho, Clarimar J.
    de Araujo, Mario Cesar Ugulino
    da Silva, Edvan Cirino
    MICROCHEMICAL JOURNAL, 2023, 187
  • [28] Adaptive Linear Discriminant Analysis Algorithm Applied to Motion Signal Classification in EEG Processing
    Xu, Rui
    Tang, Haoyue
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INFORMATION SCIENCES, MACHINERY, MATERIALS AND ENERGY (ICISMME 2015), 2015, 126 : 413 - 418
  • [29] Multi-discriminant classification algorithm for face verification
    Huang, Cheng-Ho
    Wang, Jhing-Fa
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2008, : 299 - 304
  • [30] A nonlinear discriminant algorithm for feature extraction and data classification
    Cruz, CS
    Dorronsoro, JR
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (06): : 1370 - 1376