Bayesian Robust Tensor Factorization for Incomplete Multiway Data

被引:121
|
作者
Zhao, Qibin [1 ,2 ]
Zhou, Guoxu [1 ]
Zhang, Liqing [3 ]
Cichocki, Andrzej [1 ,4 ]
Amari, Shun-Ichi [5 ]
机构
[1] RIKEN, Brain Sci Inst, Lab Adv Brain Signal Proc, 2-1 Hirosawa, Wako, Saitama 3510198, Japan
[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Key Lab Shanghai Educ Commiss Intelligent Interac, Shanghai 200240, Peoples R China
[4] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
[5] RIKEN, Brain Sci Inst, Lab Math Neurosci, 2-1 Hirosawa, Wako, Saitama 3510198, Japan
基金
中国国家自然科学基金;
关键词
Rank determination; robust factorization; tensor completion; tensor factorization; variational Bayesian (VB) inference; video background modeling; CANONICAL POLYADIC DECOMPOSITION; RANK; APPROXIMATION; OPTIMIZATION; COMPLETION; ALGORITHMS; IMAGE; PCA;
D O I
10.1109/TNNLS.2015.2423694
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a generative model for robust tensor factorization in the presence of both missing data and outliers. The objective is to explicitly infer the underlying low-CANDECOMP/PARAFAC (CP)-rank tensor capturing the global information and a sparse tensor capturing the local information (also considered as outliers), thus providing the robust predictive distribution over missing entries. The low-CP-rank tensor is modeled by multilinear interactions between multiple latent factors on which the column sparsity is enforced by a hierarchical prior, while the sparse tensor is modeled by a hierarchical view of Student-t distribution that associates an individual hyperparameter with each element independently. For model learning, we develop an efficient variational inference under a fully Bayesian treatment, which can effectively prevent the overfitting problem and scales linearly with data size. In contrast to existing related works, our method can perform model selection automatically and implicitly without the need of tuning parameters. More specifically, it can discover the groundtruth of CP rank and automatically adapt the sparsity inducing priors to various types of outliers. In addition, the tradeoff between the low-rank approximation and the sparse representation can be optimized in the sense of maximum model evidence. The extensive experiments and comparisons with many state-of-the-art algorithms on both synthetic and real-world data sets demonstrate the superiorities of our method from several perspectives.
引用
收藏
页码:736 / 748
页数:13
相关论文
共 50 条
  • [41] Tensor RPCA by Bayesian CP Factorization with Complex Noise
    Luo, Qiong
    Han, Zhi
    Chen, Xi'ai
    Wang, Yao
    Meng, Deyu
    Liang, Dong
    Tang, Yandong
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5029 - 5038
  • [42] Ensemble Bayesian Tensor Factorization for Debond Thermal NDT
    Lu, Peng
    Gao, Bin
    Feng, Qizhi
    Yang, Yang
    Woo, W. L.
    Zhao, Jian
    Qiu, Xueshi
    Gu, Liangyong
    Tian, Guiyun
    PROCEEDINGS OF 2017 IEEE FAR EAST FORUM ON NONDESTRUCTIVE EVALUATION/TESTING: NEW TECHNOLOGY & APPLICATION (IEEE FENDT 2017), 2017, : 228 - 232
  • [43] Robust Asymmetric Bayesian Adaptive Matrix Factorization
    Guo, Xin
    Pan, Boyuan
    Cai, Deng
    He, Xiaofei
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1760 - 1766
  • [44] Robust incomplete factorization based on value and space constraints
    Suarjana, M.
    Law, K.H.
    International Journal for Numerical Methods in Engineering, 1995, 38 (10)
  • [45] A robust incomplete factorization preconditioner for positive definite matrices
    Benzi, M
    Tuma, M
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2003, 10 (5-6) : 385 - 400
  • [46] A ROBUST INCOMPLETE FACTORIZATION BASED ON VALUE AND SPACE CONSTRAINTS
    SUARJANA, M
    LAW, KH
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 1995, 38 (10) : 1703 - 1719
  • [47] Convergence analysis of data augmentation algorithms for Bayesian robust multivariate linear regression with incomplete data
    Li, Haoxiang
    Qin, Qian
    Jones, Galin L.
    JOURNAL OF MULTIVARIATE ANALYSIS, 2024, 202
  • [48] Bayesian nonparametric multiway regression for clustered binomial data
    Lock, Eric F.
    Bandyopadhyay, Dipankar
    STAT, 2021, 10 (01):
  • [49] Provable Tensor Factorization with Missing Data
    Jain, Prateek
    Oh, Sewoong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [50] Structural health monitoring data cleaning based on Bayesian robust tensor learning
    Yi, Letian
    Ding, Youliang
    Hou, Jiale
    Yue, Zixiang
    Zhao, Hanwei
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2023, 22 (04): : 2169 - 2192