Nonnegative Matrix Factorization Via Archetypal Analysis

被引:7
|
作者
Javadi, Hamid [1 ]
Montanari, Andrea [2 ]
机构
[1] Rice Univ, Dept Elect & Comp Engn, POB 1892, Houston, TX 77005 USA
[2] Stanford Univ, Dept Elect Engn & Stat, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Dimensionality reduction; Matrix factorization; Separability; ALGORITHMS;
D O I
10.1080/01621459.2019.1594832
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Given a collection of data points, nonnegative matrix factorization (NMF) suggests expressing them as convex combinations of a small set of "archetypes" with nonnegative entries. This decomposition is unique only if the true archetypes are nonnegative and sufficiently sparse (or the weights are sufficiently sparse), a regime that is captured by the separability condition and its generalizations. In this article, we study an approach to NMF that can be traced back to the work of Cutler and Breiman [(1994), "Archetypal Analysis," Technometrics, 36, 338-347] and does not require the data to be separable, while providing a generally unique decomposition. We optimize a trade-off between two objectives: we minimize the distance of the data points from the convex envelope of the archetypes (which can be interpreted as an empirical risk), while also minimizing the distance of the archetypes from the convex envelope of the data (which can be interpreted as a data-dependent regularization). The archetypal analysis method of Cutler and Breiman is recovered as the limiting case in which the last term is given infinite weight. We introduce a "uniqueness condition" on the data which is necessary for identifiability. We prove that, under uniqueness (plus additional regularity conditions on the geometry of the archetypes), our estimator is robust. While our approach requires solving a nonconvex optimization problem, we find that standard optimization methods succeed in finding good solutions for both real and synthetic data. for this article are available online
引用
收藏
页码:896 / 907
页数:12
相关论文
共 50 条
  • [21] DSANLS: Accelerating Distributed Nonnegative Matrix Factorization via Sketching
    Qian, Yuqiu
    Tan, Conghui
    Mamoulis, Nikos
    Cheung, David W.
    WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 450 - 458
  • [22] Social Spammer Detection via Convex Nonnegative Matrix Factorization
    Shen, Hua
    Wang, Bangyu
    Liu, Xinyue
    Zhang, Xianchao
    IEEE ACCESS, 2022, 10 : 91192 - 91202
  • [23] Online face recognition algorithm via nonnegative matrix factorization
    Yang H.
    He G.
    Information Technology Journal, 2010, 9 (08) : 1719 - 1724
  • [24] ROBUST VISUAL TRACKING VIA INVERSE NONNEGATIVE MATRIX FACTORIZATION
    Liu, Fanghui
    Zhou, Tao
    Fu, Keren
    Gu, Irene Y. H.
    Yang, Jie
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 1491 - 1495
  • [25] Social Spammer Detection via Convex Nonnegative Matrix Factorization
    Shen, Hua
    Wang, Bangyu
    Liu, Xinyue
    Zhang, Xianchao
    IEEE Access, 2022, 10 : 91192 - 91202
  • [26] Soft-Constrained Nonnegative Matrix Factorization via Normalization
    Lan, Long
    Guan, Naiyang
    Zhang, Xiang
    Tao, Dacheng
    Luo, Zhigang
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3025 - 3030
  • [27] Collaborative Unmixing Hyperspectral Imagery via Nonnegative Matrix Factorization
    Salehani, Yaser Esmaeili
    Gazor, Saeed
    IMAGE AND SIGNAL PROCESSING (ICISP 2016), 2016, 9680 : 118 - 126
  • [28] Contrastive Topic Evolution Discovery via Nonnegative Matrix Factorization
    Li, Qianru
    Chen, Bo
    Ma, Songjun
    Fu, Luoyi
    Wang, Xinbing
    2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
  • [29] Quantized Nonnegative Matrix Factorization
    de Frein, Ruairi
    2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 377 - 382
  • [30] Quadratic nonnegative matrix factorization
    Yang, Zhirong
    Oja, Erkki
    PATTERN RECOGNITION, 2012, 45 (04) : 1500 - 1510