A nonnegative matrix factorization framework for semi-supervised document clustering with dual constraints

被引:10
|
作者
Ma, Huifang [1 ]
Zhao, Weizhong [2 ]
Shi, Zhongzhi [3 ]
机构
[1] Northwest Normal Univ, Coll Comp Sci & Engn, Lanzhou 730070, Gansu, Peoples R China
[2] Xiangtan Univ, Coll Informat Engn, Xiangtan 411105, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Nonnegative matrix factorization; Semi-supervised clustering; Dual constraints; Pair-wise constraints; Word-level constraints; ALGORITHMS;
D O I
10.1007/s10115-012-0560-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a new semi-supervised co-clustering algorithm Orthogonal Semi-Supervised Nonnegative Matrix Factorization (OSS-NMF) for document clustering. In this new approach, the clustering process is carried out by incorporating both prior domain knowledge of data points (documents) in the form of pair-wise constraints and category knowledge of features (words) into the NMF co-clustering framework. Under this framework, the clustering problem is formulated as the problem of finding the local minimizer of objective function, taking into account the dual prior knowledge. The update rules are derived, and an iterative algorithm is designed for the co-clustering process. Theoretically, we prove the correctness and convergence of our algorithm and demonstrate its mathematical rigorous. Our experimental evaluations show that the proposed document clustering model presents remarkable performance improvements with those constraints.
引用
收藏
页码:629 / 651
页数:23
相关论文
共 50 条
  • [41] Constrained nonnegative matrix factorization-based semi-supervised multilabel learning
    Dingguo Yu
    Bin Fu
    Guandong Xu
    Aihong Qin
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 1093 - 1100
  • [42] Robust Semi-Supervised Community Detection Based on Symmetric Nonnegative Matrix Factorization
    Xie, Wenyun
    Peng, Siyuan
    Yang, Zhijing
    2024 5th International Conference on Computer Engineering and Intelligent Control, ICCEIC 2024, 2024, : 55 - 61
  • [43] Semi-supervised pivotal-aware nonnegative matrix factorization with label and pairwise constraint propagation for data clustering
    Yang, Xiaojun
    Zhu, Tuoji
    Peng, Siyuan
    Nie, Feiping
    Lin, Zhiping
    PATTERN RECOGNITION, 2025, 157
  • [44] Non-negative matrix factorization for semi-supervised data clustering
    Chen, Yanhua
    Rege, Manjeet
    Dong, Ming
    Hua, Jing
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 17 (03) : 355 - 379
  • [45] Non-negative matrix factorization for semi-supervised data clustering
    Yanhua Chen
    Manjeet Rege
    Ming Dong
    Jing Hua
    Knowledge and Information Systems, 2008, 17 : 355 - 379
  • [46] Semi-supervised Microblog Clustering Method via Dual Constraints
    Ma, Huifang
    Jia, Meihuizi
    Zhao, Weizhong
    Lin, Xianghong
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2015, 2015, 9403 : 360 - 369
  • [47] Semi-supervised document clustering via active learning with pairwise constraints
    Huang, Ruizhang
    Lam, Wai
    ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 517 - 522
  • [48] TRANSDUCTIVE NONNEGATIVE MATRIX FACTORIZATION FOR SEMI-SUPERVISED HIGH-PERFORMANCE SPEECH SEPARATION
    Guan, Naiyang
    Lan, Long
    Tao, Dacheng
    Luo, Zhigang
    Yang, Xuejun
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [49] Simultaneous fault detection and isolation using semi-supervised kernel nonnegative matrix factorization
    Zhai, Lirong
    Jia, Qilong
    CANADIAN JOURNAL OF CHEMICAL ENGINEERING, 2019, 97 (12): : 3025 - 3034
  • [50] Semi-supervised graph regularized nonnegative matrix factorization with local coordinate for image representation
    Li, Huirong
    Gao, Yuelin
    Liu, Junmin
    Zhang, Jiangshe
    Li, Chao
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 102