A nonnegative matrix factorization framework for semi-supervised document clustering with dual constraints

被引:10
|
作者
Ma, Huifang [1 ]
Zhao, Weizhong [2 ]
Shi, Zhongzhi [3 ]
机构
[1] Northwest Normal Univ, Coll Comp Sci & Engn, Lanzhou 730070, Gansu, Peoples R China
[2] Xiangtan Univ, Coll Informat Engn, Xiangtan 411105, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Nonnegative matrix factorization; Semi-supervised clustering; Dual constraints; Pair-wise constraints; Word-level constraints; ALGORITHMS;
D O I
10.1007/s10115-012-0560-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a new semi-supervised co-clustering algorithm Orthogonal Semi-Supervised Nonnegative Matrix Factorization (OSS-NMF) for document clustering. In this new approach, the clustering process is carried out by incorporating both prior domain knowledge of data points (documents) in the form of pair-wise constraints and category knowledge of features (words) into the NMF co-clustering framework. Under this framework, the clustering problem is formulated as the problem of finding the local minimizer of objective function, taking into account the dual prior knowledge. The update rules are derived, and an iterative algorithm is designed for the co-clustering process. Theoretically, we prove the correctness and convergence of our algorithm and demonstrate its mathematical rigorous. Our experimental evaluations show that the proposed document clustering model presents remarkable performance improvements with those constraints.
引用
收藏
页码:629 / 651
页数:23
相关论文
共 50 条
  • [31] Semi-supervised hyperspectral unmixing approach based on nonnegative matrix factorization
    Zhang, Lifu
    Wang, Nan
    Zhang, Xia
    Chen, Zhengfu
    Gao, Min
    REMOTELY SENSED DATA COMPRESSION, COMMUNICATIONS, AND PROCESSING XII, 2016, 9874
  • [32] Semi-supervised nonnegative matrix factorization with positive and negative label propagations
    Changpeng Wang
    Jiangshe Zhang
    Tianjun Wu
    Meng Zhang
    Guang Shi
    Applied Intelligence, 2022, 52 : 9739 - 9750
  • [33] Network Embedding Using Semi-Supervised Kernel Nonnegative Matrix Factorization
    He, Chaobo
    Zhang, Qiong
    Tang, Yong
    Liu, Shuangyin
    Liu, Hai
    IEEE ACCESS, 2019, 7 : 92732 - 92744
  • [34] Semi-Supervised Multi-view clustering based on orthonormality-constrained nonnegative matrix factorization
    Cai, Hao
    Liu, Bo
    Xiao, Yanshan
    Lin, Luyue
    INFORMATION SCIENCES, 2020, 536 : 171 - 184
  • [35] Document clustering using nonnegative matrix factorization/
    Shahnaz, F
    Berry, MW
    Pauca, VP
    Plemmons, RJ
    INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (02) : 373 - 386
  • [36] Deep matrix factorization with knowledge transfer for lifelong clustering and semi-supervised clustering
    Zhang, Yiling
    Wang, Hao
    Yang, Yan
    Zhou, Wei
    Li, Tianrui
    Ouyang, Xiaocao
    Chen, Hongyang
    INFORMATION SCIENCES, 2021, 570 : 795 - 814
  • [37] Nonnegative Matrix Factorization for Document Clustering: A Survey
    Hosseini-Asl, Ehsan
    Zurada, Jacek M.
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2014, PT II, 2014, 8468 : 726 - 737
  • [38] Community detection method based on robust semi-supervised nonnegative matrix factorization
    He, Chaobo
    Zhang, Qiong
    Tang, Yong
    Liu, Shuangyin
    Zheng, Jianhua
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 523 : 279 - 291
  • [39] Constrained nonnegative matrix factorization-based semi-supervised multilabel learning
    Yu, Dingguo
    Fu, Bin
    Xu, Guandong
    Qin, Aihong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (05) : 1093 - 1100
  • [40] Semi-supervised Nonnegative Matrix Factorization for gene expression deconvolution: A case study
    Gaujoux, Renaud
    Seoighe, Cathal
    INFECTION GENETICS AND EVOLUTION, 2012, 12 (05) : 913 - 921