A nonnegative matrix factorization framework for semi-supervised document clustering with dual constraints

被引:10
|
作者
Ma, Huifang [1 ]
Zhao, Weizhong [2 ]
Shi, Zhongzhi [3 ]
机构
[1] Northwest Normal Univ, Coll Comp Sci & Engn, Lanzhou 730070, Gansu, Peoples R China
[2] Xiangtan Univ, Coll Informat Engn, Xiangtan 411105, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Nonnegative matrix factorization; Semi-supervised clustering; Dual constraints; Pair-wise constraints; Word-level constraints; ALGORITHMS;
D O I
10.1007/s10115-012-0560-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a new semi-supervised co-clustering algorithm Orthogonal Semi-Supervised Nonnegative Matrix Factorization (OSS-NMF) for document clustering. In this new approach, the clustering process is carried out by incorporating both prior domain knowledge of data points (documents) in the form of pair-wise constraints and category knowledge of features (words) into the NMF co-clustering framework. Under this framework, the clustering problem is formulated as the problem of finding the local minimizer of objective function, taking into account the dual prior knowledge. The update rules are derived, and an iterative algorithm is designed for the co-clustering process. Theoretically, we prove the correctness and convergence of our algorithm and demonstrate its mathematical rigorous. Our experimental evaluations show that the proposed document clustering model presents remarkable performance improvements with those constraints.
引用
收藏
页码:629 / 651
页数:23
相关论文
共 50 条
  • [11] Dual semi-supervised convex nonnegative matrix factorization for data representation
    Peng, Siyuan
    Yang, Zhijing
    Ling, Bingo Wing-Kuen
    Chen, Badong
    Lin, Zhiping
    INFORMATION SCIENCES, 2022, 585 : 571 - 593
  • [12] Semi-supervised concept factorization for document clustering
    Lu, Mei
    Zhao, Xiang-Jun
    Zhang, Li
    Li, Fan-Zhang
    INFORMATION SCIENCES, 2016, 331 : 86 - 98
  • [13] Solving consensus and semi-supervised clustering problems using nonnegative matrix factorization
    Li, Tao
    Ding, Chris
    Jordan, Michael I.
    ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 577 - +
  • [14] Semi-supervised collective matrix factorization for topic detection and document clustering
    Wang, Ye
    Zhang, Yanchun
    Zhou, Bin
    Jia, Yan
    2017 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC), 2017, : 88 - 97
  • [15] Hypergraph based semi-supervised symmetric nonnegative matrix factorization for image clustering
    Yin, Jingxing
    Peng, Siyuan
    Yang, Zhijing
    Chen, Badong
    Lin, Zhiping
    PATTERN RECOGNITION, 2023, 137
  • [16] Semi-supervised Nonnegative Matrix Factorization for Microblog Clustering Based on Term Correlation
    Ma, Huifang
    Jia, Meihuizi
    Shi, Yakai
    Hao, Zhanjun
    WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, 2014, 8709 : 511 - 516
  • [17] Semi-supervised Nonnegative Matrix Factorization with Commonness Extraction
    Teng, Yueyang
    Qi, Shouliang
    Dai, Yin
    Xu, Lisheng
    Qian, Wei
    Kang, Yan
    NEURAL PROCESSING LETTERS, 2017, 45 (03) : 1063 - 1076
  • [18] Semi-supervised Nonnegative Matrix Factorization with Commonness Extraction
    Yueyang Teng
    Shouliang Qi
    Yin Dai
    Lisheng Xu
    Wei Qian
    Yan Kang
    Neural Processing Letters, 2017, 45 : 1063 - 1076
  • [19] Semi-supervised multi-view clustering based on constrained nonnegative matrix factorization
    Cai, Hao
    Liu, Bo
    Xiao, Yanshan
    Lin, LuYue
    KNOWLEDGE-BASED SYSTEMS, 2019, 182
  • [20] Multiple graph regularized semi-supervised nonnegative matrix factorization with adaptive weights for clustering
    Zhang, Kexin
    Zhao, Xuezhuan
    Peng, Siyuan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106