Dimensionality Reduction with Category Information Fusion and Non-negative Matrix Factorization for Text Categorization

被引:0
|
作者
Zheng, Wenbin [1 ,2 ]
Qian, Yuntao [1 ]
Tang, Hong [3 ,4 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310003, Zhejiang, Peoples R China
[2] China Jiliang Univ, Coll Informat Engn, Hangzhou 310003, Zhejiang, Peoples R China
[3] Zhejiang Univ, Sch Aeronaut & Astronaut, Hangzhou 310003, Zhejiang, Peoples R China
[4] China Jiliang Univ, Coll Metrol Technol & Engn, Hangzhou, Peoples R China
关键词
Text Categorization; Dimensionality reduction; Non-negative Matrix Factorization; Category Fusion; CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimensionality reduction can efficiently improve computing performance of classifiers in text categorization, and non-negative matrix factorization could map the high dimensional term space into a low dimensional semantic subspace easily. Meanwhile, the non-negative of the basis vectors could provide a meaningful explanation for the semantic subspace. However, it usually could not achieve a satisfied classification performance because it is sensitive to the noise, data missing and outlier as a linear reconstruction method. This paper proposes a novel approach in which the train text and its category information are fused and a transformation matrix that maps the term space into a semantic subspace is obtained by a basis orthogonality non-negative matrix factorization and truncation. Finally, the dimensionality can be reduced aggressively with these transformations. Experimental results show that the proposed approach remains a good classification performance in a very low dimensional case.
引用
收藏
页码:505 / +
页数:2
相关论文
共 50 条
  • [1] Dimensionality reduction using non-negative matrix factorization for information retrieval
    Tsuge, S
    Shishibori, M
    Kuroiwa, S
    Kita, K
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 960 - 965
  • [2] Structure preserving non-negative matrix factorization for dimensionality reduction
    Li, Zechao
    Liu, Jing
    Lu, Hanqing
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (09) : 1175 - 1189
  • [3] Dimensionality reduction of vector space information retrieval model based on non-negative matrix factorization
    Tsuge, S
    Shishibori, M
    Kuroiwa, S
    Hirai, T
    Kita, K
    KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 367 - 371
  • [4] Non-negative matrix factorization with local preservation for hyperspectral image dimensionality reduction
    Xiao, Zhiyong
    REMOTE SENSING LETTERS, 2014, 5 (09) : 793 - 802
  • [5] Dimensionality Reduction for Histogram Features Based on Supervised Non-negative Matrix Factorization
    Ambai, Mitsuru
    Utama, Nugraha P.
    Yoshida, Yuichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (10) : 1870 - 1879
  • [6] Image fusion based on non-negative matrix factorization
    Zhang, JY
    Wei, L
    Miao, QG
    Wang, Y
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 973 - 976
  • [7] Dimensionality reduction by combining category information and latent semantic index for text categorization
    Zheng, Wenbin
    An, Lixin
    Xu, Zhanyi
    Journal of Information and Computational Science, 2013, 10 (08): : 2463 - 2469
  • [8] A constrained non-negative matrix factorization in information retrieval
    Xu, BW
    Lu, JJ
    Huang, GS
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2003, : 273 - 277
  • [9] A neural network for determination of latent dimensionality in non-negative matrix factorization
    Nebgen, Benjamin T.
    Vangara, Raviteja
    Hombrados-Herrera, Miguel A.
    Kuksova, Svetlana
    Alexandrov, Boian S.
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2021, 2 (02):
  • [10] Knowledge Extraction with Non-Negative Matrix Factorization for Text Classification
    Silva, Catarina
    Ribeiro, Bernardete
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, PROCEEDINGS, 2009, 5788 : 300 - +