Dimensionality Reduction with Category Information Fusion and Non-negative Matrix Factorization for Text Categorization

被引:0
|
作者
Zheng, Wenbin [1 ,2 ]
Qian, Yuntao [1 ]
Tang, Hong [3 ,4 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310003, Zhejiang, Peoples R China
[2] China Jiliang Univ, Coll Informat Engn, Hangzhou 310003, Zhejiang, Peoples R China
[3] Zhejiang Univ, Sch Aeronaut & Astronaut, Hangzhou 310003, Zhejiang, Peoples R China
[4] China Jiliang Univ, Coll Metrol Technol & Engn, Hangzhou, Peoples R China
关键词
Text Categorization; Dimensionality reduction; Non-negative Matrix Factorization; Category Fusion; CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimensionality reduction can efficiently improve computing performance of classifiers in text categorization, and non-negative matrix factorization could map the high dimensional term space into a low dimensional semantic subspace easily. Meanwhile, the non-negative of the basis vectors could provide a meaningful explanation for the semantic subspace. However, it usually could not achieve a satisfied classification performance because it is sensitive to the noise, data missing and outlier as a linear reconstruction method. This paper proposes a novel approach in which the train text and its category information are fused and a transformation matrix that maps the term space into a semantic subspace is obtained by a basis orthogonality non-negative matrix factorization and truncation. Finally, the dimensionality can be reduced aggressively with these transformations. Experimental results show that the proposed approach remains a good classification performance in a very low dimensional case.
引用
收藏
页码:505 / +
页数:2
相关论文
共 50 条
  • [31] Non-negative Matrix Factorization on GPU
    Platos, Jan
    Gajdos, Petr
    Kroemer, Pavel
    Snasel, Vaclav
    NETWORKED DIGITAL TECHNOLOGIES, PT 1, 2010, 87 : 21 - 30
  • [32] On affine non-negative matrix factorization
    Laurberg, Hans
    Hansen, Lars Kai
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 653 - +
  • [33] Knowledge reduction in formal contexts using non-negative matrix factorization
    Kumar, Ch. Aswani
    Dias, Sergio M.
    Vieira, Newton J.
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2015, 109 : 46 - 63
  • [34] Non-negative matrix factorization based text mining: Feature extraction and classification
    Barman, P. C.
    Iqbal, Nadeem
    Lee, Soo-Young
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 703 - 712
  • [35] Aerial Image Information Extraction Based on Non-negative Matrix Factorization
    Hao Hong
    Xu Changqing
    Zhang Xinping
    Chinese Forestry Science and Technology, 2012, 11 (03) : 55 - 55
  • [36] On the Effectiveness of Non-negative Matrix Factorization for Text Open-Set Recognition
    Impedovo, Angelo
    Rizzo, Giuseppe
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2023, PT III, 2025, 2135 : 541 - 552
  • [37] Image semantic information mining algorithm by non-negative matrix factorization
    Li Yan
    Zhou Xingbo
    2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND ENGINEERING APPLICATIONS, 2013, : 345 - 348
  • [38] Non-negative matrix factorization for target recognition
    Long, Hong-Lin
    Pi, Yi-Ming
    Cao, Zong-Jie
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2010, 38 (06): : 1425 - 1429
  • [39] On the Construction of Non-Negative Dimensionality Reduction Methods
    Sara Krause-Solberg
    Mijail Guillemard
    Armin Iske
    Sampling Theory in Signal and Image Processing, 2017, 16 (1): : 23 - 36
  • [40] Rank selection for non-negative matrix factorization
    Cai, Yun
    Gu, Hong
    Kenney, Toby
    STATISTICS IN MEDICINE, 2023, 42 (30) : 5676 - 5693