Continuous Semi-Supervised Nonnegative Matrix Factorization

被引:2
|
作者
Lindstrom, Michael R. R. [1 ]
Ding, Xiaofu [2 ]
Liu, Feng [2 ]
Somayajula, Anand [2 ]
Needell, Deanna [2 ]
机构
[1] Univ Texas Rio Grande Valley, Sch Math & Stat Sci, Edinburg, TX 78539 USA
[2] Univ Calif Los Angeles, Dept Math, Los Angeles, CA 90095 USA
关键词
topic modelling; regression; nonnegative matrix factorization; optimization; CONSTRAINED LEAST-SQUARES; ALGORITHMS;
D O I
10.3390/a16040187
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nonnegative matrix factorization can be used to automatically detect topics within a corpus in an unsupervised fashion. The technique amounts to an approximation of a nonnegative matrix as the product of two nonnegative matrices of lower rank. In certain applications it is desirable to extract topics and use them to predict quantitative outcomes. In this paper, we show Nonnegative Matrix Factorization can be combined with regression on a continuous response variable by minimizing a penalty function that adds a weighted regression error to a matrix factorization error. We show theoretically that as the weighting increases, the regression error in training decreases weakly. We test our method on synthetic data and real data coming from Rate My Professors reviews to predict an instructor's rating from the text in their reviews. In practice, when used as a dimensionality reduction method (when the number of topics chosen in the model is fewer than the true number of topics), the method performs better than doing regression after topics are identified-both during training and testing-and it retrains interpretability.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Orthogonal Nonnegative Matrix Tri-factorization for Semi-supervised Document Co-clustering
    Ma, Huifang
    Zhao, Weizhong
    Tan, Qing
    Shi, Zhongzhi
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II, PROCEEDINGS, 2010, 6119 : 189 - +
  • [42] Discriminatively Constrained Semi-Supervised Multi-View Nonnegative Matrix Factorization with Graph Regularization
    Cui, Guosheng
    Li, Ye
    Li, Jianzhong
    Fan, Jianping
    BIG DATA MINING AND ANALYTICS, 2024, 7 (01): : 55 - 74
  • [43] Robust semi-supervised data representation and imputation by correntropy based constraint nonnegative matrix factorization
    Nan Zhou
    Yuanhua Du
    Jun Liu
    Xiuyu Huang
    Xiao Shen
    Kup-Sze Choi
    Applied Intelligence, 2023, 53 : 11599 - 11617
  • [44] Deep nonsmooth nonnegative matrix factorization network with semi-supervised learning for SAR image change detection
    Li, Heng-Chao
    Yang, Gang
    Yang, Wen
    Du, Qian
    Emery, William J.
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 160 (160) : 167 - 179
  • [45] Semi-Supervised Multi-view Multi-label Classification Based on Nonnegative Matrix Factorization
    Wang, Guangxia
    Zhang, Changqing
    Zhu, Pengfei
    Hu, Qinghua
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 340 - 348
  • [46] Multiple Graph Adaptive Regularized Semi-Supervised Nonnegative Matrix Factorization with Sparse Constraint for Data Representation
    Zhang, Kexin
    Li, Lingling
    Di, Jinhong
    Wang, Yi
    Zhao, Xuezhuan
    Zhang, Ji
    PROCESSES, 2022, 10 (12)
  • [47] Semi-Supervised Nonnegative Matrix Factorization of Wide-Field Fluorescence Microscopic Images for Tissue Diagnosis
    Soman, Shania M.
    Rekha, Charuvil Radhakrishna Pillai
    Santhakumar, Hema
    Narendrakumar, Uttamchand
    Jayasree, Ramapurath S.
    MICROSCOPY AND MICROANALYSIS, 2020, 26 (03) : 419 - 428
  • [48] Semi-Supervised Multi-view clustering based on orthonormality-constrained nonnegative matrix factorization
    Cai, Hao
    Liu, Bo
    Xiao, Yanshan
    Lin, Luyue
    INFORMATION SCIENCES, 2020, 536 : 171 - 184
  • [49] Real-Time Independent Vector Analysis Using Semi-Supervised Nonnegative Matrix Factorization as a Source Model
    Wang, Taihui
    Yang, Feiran
    Zhu, Rui
    Yang, Jun
    INTERSPEECH 2021, 2021, : 1842 - 1846
  • [50] Semi-supervised pivotal-aware nonnegative matrix factorization with label and pairwise constraint propagation for data clustering
    Yang, Xiaojun
    Zhu, Tuoji
    Peng, Siyuan
    Nie, Feiping
    Lin, Zhiping
    PATTERN RECOGNITION, 2025, 157