Continuous Semi-Supervised Nonnegative Matrix Factorization

被引:2
|
作者
Lindstrom, Michael R. R. [1 ]
Ding, Xiaofu [2 ]
Liu, Feng [2 ]
Somayajula, Anand [2 ]
Needell, Deanna [2 ]
机构
[1] Univ Texas Rio Grande Valley, Sch Math & Stat Sci, Edinburg, TX 78539 USA
[2] Univ Calif Los Angeles, Dept Math, Los Angeles, CA 90095 USA
关键词
topic modelling; regression; nonnegative matrix factorization; optimization; CONSTRAINED LEAST-SQUARES; ALGORITHMS;
D O I
10.3390/a16040187
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nonnegative matrix factorization can be used to automatically detect topics within a corpus in an unsupervised fashion. The technique amounts to an approximation of a nonnegative matrix as the product of two nonnegative matrices of lower rank. In certain applications it is desirable to extract topics and use them to predict quantitative outcomes. In this paper, we show Nonnegative Matrix Factorization can be combined with regression on a continuous response variable by minimizing a penalty function that adds a weighted regression error to a matrix factorization error. We show theoretically that as the weighting increases, the regression error in training decreases weakly. We test our method on synthetic data and real data coming from Rate My Professors reviews to predict an instructor's rating from the text in their reviews. In practice, when used as a dimensionality reduction method (when the number of topics chosen in the model is fewer than the true number of topics), the method performs better than doing regression after topics are identified-both during training and testing-and it retrains interpretability.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] A nonnegative matrix factorization framework for semi-supervised document clustering with dual constraints
    Ma, Huifang
    Zhao, Weizhong
    Shi, Zhongzhi
    KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 36 (03) : 629 - 651
  • [22] Constrained nonnegative matrix factorization-based semi-supervised multilabel learning
    Yu, Dingguo
    Fu, Bin
    Xu, Guandong
    Qin, Aihong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (05) : 1093 - 1100
  • [23] A nonnegative matrix factorization framework for semi-supervised document clustering with dual constraints
    Huifang Ma
    Weizhong Zhao
    Zhongzhi Shi
    Knowledge and Information Systems, 2013, 36 : 629 - 651
  • [24] Semi-supervised Nonnegative Matrix Factorization for gene expression deconvolution: A case study
    Gaujoux, Renaud
    Seoighe, Cathal
    INFECTION GENETICS AND EVOLUTION, 2012, 12 (05) : 913 - 921
  • [25] Constrained nonnegative matrix factorization-based semi-supervised multilabel learning
    Dingguo Yu
    Bin Fu
    Guandong Xu
    Aihong Qin
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 1093 - 1100
  • [26] Robust Semi-Supervised Community Detection Based on Symmetric Nonnegative Matrix Factorization
    Xie, Wenyun
    Peng, Siyuan
    Yang, Zhijing
    2024 5th International Conference on Computer Engineering and Intelligent Control, ICCEIC 2024, 2024, : 55 - 61
  • [27] Semi-supervised Nonnegative Matrix Factorization for Microblog Clustering Based on Term Correlation
    Ma, Huifang
    Jia, Meihuizi
    Shi, Yakai
    Hao, Zhanjun
    WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, 2014, 8709 : 511 - 516
  • [28] Semi-supervised multi-view clustering based on constrained nonnegative matrix factorization
    Cai, Hao
    Liu, Bo
    Xiao, Yanshan
    Lin, LuYue
    KNOWLEDGE-BASED SYSTEMS, 2019, 182
  • [29] Multiple graph regularized semi-supervised nonnegative matrix factorization with adaptive weights for clustering
    Zhang, Kexin
    Zhao, Xuezhuan
    Peng, Siyuan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [30] Multiview Clustering via Hypergraph Induced Semi-Supervised Symmetric Nonnegative Matrix Factorization
    Peng, Siyuan
    Yin, Jingxing
    Yang, Zhijing
    Chen, Badong
    Lin, Zhiping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5510 - 5524