Supervised multiway factorization

被引:11
|
作者
Lock, Eric F. [1 ]
Li, Gen [2 ]
机构
[1] Univ Minnesota, Div Biostat, Sch Publ Hlth, Minneapolis, MN 55455 USA
[2] Columbia Univ, Mailman Sch Publ Hlth, Dept Biostat, New York, NY 10032 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2018年 / 12卷 / 01期
基金
美国国家卫生研究院;
关键词
Faces in the wild; dimension reduction; latent variables; parafac/candecomp; singular value decomposition; tensors; POPULATION VALUE DECOMPOSITION; TENSOR REGRESSION; FRAMEWORK; SPARSE;
D O I
10.1214/18-EJS1421
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We describe a probabilistic PARAFAC/CANDECOMP (CP) factorization for multiway (i.e., tensor) data that incorporates auxiliary covariates, SupCP. SupCP generalizes the supervised singular value decomposition (SupSVD) for vector-valued observations, to allow for observations that have the form of a matrix or higher-order array. Such data are increasingly encountered in biomedical research and other fields. We use a novel likelihood-based latent variable representation of the CP factorization, in which the latent variables are informed by additional covariates. We give conditions for identifiability, and develop an EM algorithm for simultaneous estimation of all model parameters. SupCP can be used for dimension reduction, capturing latent structures that are more accurate and interpretable due to covariate supervision. Moreover, SupCP specifies a full probability distribution for a multiway data observation with given covariate values, which can be used for predictive modeling. We conduct comprehensive simulations to evaluate the SupCP algorithm. We apply it to a facial image database with facial descriptors (e.g., smiling / not smiling) as covariates, and to a study of amino acid fluorescence. Software is available at https://github.com/lockEF/SupCP.
引用
收藏
页码:1150 / 1180
页数:31
相关论文
共 50 条
  • [41] Loss Factorization, Weakly Supervised Learning and Label Noise Robustness
    Patrini, Giorgio
    Nielsen, Frank
    Nock, Richard
    Carioni, Marcello
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [42] OVERLAPPING SOUND EVENT DETECTION WITH SUPERVISED NONNEGATIVE MATRIX FACTORIZATION
    Bisot, Victor
    Essid, Slim
    Richard, Gael
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 31 - 35
  • [43] FULLY SUPERVISED NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE EXTRACTION
    Austin, Woody
    Anderson, Dylan
    Ghosh, Joydeep
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 5772 - 5775
  • [44] Guided Semi-Supervised Non-Negative Matrix Factorization
    Li, Pengyu
    Tseng, Christine
    Zheng, Yaxuan
    Chew, Joyce A.
    Huang, Longxiu
    Jarman, Benjamin
    Needell, Deanna
    ALGORITHMS, 2022, 15 (05)
  • [45] Robust semi-supervised nonnegative matrix factorization for image clustering
    Peng, Siyuan
    Ser, Wee
    Chen, Badong
    Lin, Zhiping
    PATTERN RECOGNITION, 2021, 111
  • [46] Semi-supervised Subspace Learning via Constrained Matrix Factorization
    Viet Hang Duong
    Manh Quan Bui
    Jia Ching Wang
    2021 3RD INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR INDUSTRY 4.0 (STI), 2021,
  • [47] Protein complex detection based on semi-supervised matrix factorization
    Wu, Junxian
    Lin, Mingxiong
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 8205 - 8208
  • [48] Document classification with unsupervised nonnegative matrix factorization and supervised percetron learning
    Barman, Paresh Chandra
    Lee, Soo-Young
    2007 INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, 2007, : 183 - +
  • [49] Music Signal Separation by Supervised Nonnegative Matrix Factorization with Basis Deformation
    Kitamura, Daichi
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    Kondo, Kazunobu
    Takahashi, Yu
    2013 18TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2013,
  • [50] Weakly supervised nonnegative matrix factorization for user-driven clustering
    Jaegul Choo
    Changhyun Lee
    Chandan K. Reddy
    Haesun Park
    Data Mining and Knowledge Discovery, 2015, 29 : 1598 - 1621