An efficient discriminant analysis algorithm for document classification

被引:3
|
作者
Wang Z. [1 ]
Sun X. [1 ]
机构
[1] Henan University of Technology, Zhengzhou
关键词
Data mining; Dimensionality reduction; Document classification; Kernel discriminant analysis;
D O I
10.4304/jsw.6.7.1265-1272
中图分类号
学科分类号
摘要
Document categorization has become one of the most important research areas of pattern recognition and data mining due to the exponential growth of documents in the Internet and the emergent need to organize them. The document space is always of very high dimensionality and learning in such a high dimensional space is often impossible due to the curse of dimensionality. To cope with performance and accuracy problems with high dimensionality, a novel dimensionality reduction algorithm called IKDA is proposed in this paper. The proposed IKDA algorithm combines kernel-based learning techniques and direct iterative optimization procedure to deal with the nonlinearity of the document distribution. The proposed algorithm also effectively solves the so-called "small sample size" problem in document classification task. Extensive experimental results on two real world data sets demonstrate the effectiveness and efficiency of the proposed algorithm. © 2011 ACADEMY PUBLISHER.
引用
收藏
页码:1265 / 1272
页数:7
相关论文
共 50 条
  • [31] Classification of apoptosis proteins by discriminant analysis
    Kandemir-Cavas, Cagin
    Nasibov, Efendi
    TURKISH JOURNAL OF BIOCHEMISTRY-TURK BIYOKIMYA DERGISI, 2012, 37 (01): : 54 - 61
  • [32] PROBABILITIES OF CORRECT CLASSIFICATION IN DISCRIMINANT ANALYSIS
    DUNN, OJ
    VARADY, PD
    BIOMETRICS, 1966, 22 (04) : 908 - &
  • [33] Parameterized discriminant analysis for image classification
    Tian, Q
    Yu, J
    Rui, T
    Huang, TS
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 5 - 8
  • [34] DISCRIMINANT ANALYSIS AND CLASSIFICATION OF MUTUAL FUNDS
    LECLAIR, RT
    JOURNAL OF ECONOMICS AND BUSINESS, 1974, 26 (03) : 220 - 224
  • [35] Multiscale Slant Discriminant Analysis for Classification
    Jiao, QingLiang
    MODERN ADVANCES IN APPLIED INTELLIGENCE, IEA/AIE 2014, PT I, 2014, 8481 : 273 - 281
  • [36] Discriminant Analysis for Radar Signal Classification
    Guo, Shanzeng
    Tracey, Hannah
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2020, 56 (04) : 3134 - 3148
  • [37] Efficient document clustering algorithm and its application to a document browser
    Tanaka, Hideki
    Kumano, Tadashi
    Uratani, Noriyoshi
    Ehara, Terumasa
    Information Processing and Management, 1999, 35 (04): : 541 - 557
  • [38] An efficient document clustering algorithm and its application to a document browser
    Tanaka, H
    Kumano, T
    Uratani, N
    Ehara, T
    INFORMATION PROCESSING & MANAGEMENT, 1999, 35 (04) : 541 - 557
  • [39] G-KNN: An Efficient Document Classification Algorithm for Sparse Datasets on GPUs using KNN
    Rocha, Leonardo
    Ramos, Gabriel
    Chaves, Rodrigo
    Sachetto, Rafael
    Madeira, Daniel
    Viegas, Felipe
    Andrade, Guilherme
    Daniel, Sergio
    Goncalves, Marcos
    Ferreira, Renato
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 1335 - 1338
  • [40] Efficient implementation of associative classifiers for document classification
    Yoon, Yongwook
    Lee, Gary Geunbae
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (02) : 393 - 405