Tensors for Data Mining and Data Fusion: Models, Applications, and Scalable Algorithms

被引:256
|
作者
Papalexakis, Evangelos E. [1 ]
Faloutsos, Christos [2 ]
Sidiropoulos, Nicholas D. [3 ,4 ]
机构
[1] Univ Calif Riverside, Dept Comp Sci & Engn, 355 Winston Chung Hall, Riverside, CA 92521 USA
[2] Carnegie Mellon Univ, Dept Comp Sci, GHC 8019,5000 Forbes Ave, Pittsburgh, PA 15213 USA
[3] Univ Minnesota, Dept Elect & Comp Engn, 200 Union St SE, Minneapolis, MN 55455 USA
[4] Univ Minnesota, Dept ECE, Digital Technol Ctr, 200 Union St SE, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
Tensors; tensor decomposition; tensor factorization; multi-aspect data; multi-way analysis; LEAST-SQUARES ALGORITHM; MULTILINEAR DECOMPOSITION; LINK PREDICTION; PARAFAC; MATRIX; UNIQUENESS; FACTORIZATION; COMPONENTS; RANK; CANDECOMP/PARAFAC;
D O I
10.1145/2915921
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tensors and tensor decompositions are very powerful and versatile tools that can model a wide variety of heterogeneous, multiaspect data. As a result, tensor decompositions, which extract useful latent information out of multiaspect data tensors, have witnessed increasing popularity and adoption by the data mining community. In this survey, we present some of the most widely used tensor decompositions, providing the key insights behind them, and summarizing them from a practitioner's point of view. We then provide an overview of a very broad spectrum of applications where tensors have been instrumental in achieving state-of-the-art performance, ranging from social network analysis to brain data analysis, and from web mining to healthcare. Subsequently, we present recent algorithmic advances in scaling tensor decompositions up to today's big data, outlining the existing systems and summarizing the key ideas behind them. Finally, we conclude with a list of challenges and open problems that outline exciting future research directions.
引用
收藏
页数:44
相关论文
共 50 条
  • [41] A communication efficient and scalable distributed data mining for the astronomical data
    Govada, A.
    Sahay, S. K.
    ASTRONOMY AND COMPUTING, 2016, 16 : 166 - 173
  • [42] Scalable data fusion using astrolabe
    Birman, KP
    van Renesse, R
    Vogels, W
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOL II, 2002, : 1434 - 1441
  • [43] Development of clinical prediction models of childhood asthma by data mining algorithms
    Su, M-W
    Tsai, C-H
    Tung, K-Y
    Chou, C-C
    Huang, Y-T
    ALLERGY, 2013, 68 : 29 - 29
  • [44] Scalable Topological Data Analysis and Visualization for Evaluating Data-Driven Models in Scientific Applications
    Liu, Shusen
    Wang, Di
    Maljovec, Dan
    Anirudh, Rushil
    Thiagarajan, Jayaraman J.
    Jacobs, Sam Ade
    Van Essen, Brian C.
    Hysom, David
    Yeom, Jae-Seung
    Gaffney, Jim
    Peterson, Luc
    Robinson, Peter B.
    Bhatia, Harsh
    Pascucci, Valerio
    Spears, Brian K.
    Bremer, Peer-Timo
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (01) : 291 - 300
  • [45] Development of Evolutionary Data Mining Algorithms and their Applications to Cardiac Disease Diagnosis
    Liu, Jenn-Long
    Hsu, Yu-Tzu
    Hung, Chih-Lung
    2012 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2012,
  • [46] DATA FUSION IN SEVERAL ALGORITHMS
    Lipovetsky, Stan
    ADVANCES IN DATA SCIENCE AND ADAPTIVE ANALYSIS, 2013, 5 (03)
  • [47] Trie:: An alternative data structure for data mining algorithms
    Bodon, F
    Rónyai, L
    MATHEMATICAL AND COMPUTER MODELLING, 2003, 38 (7-9) : 739 - 751
  • [48] Privacy preserving data mining algorithms by data distortion
    Wu Xiao-dan
    Yue Dian-min
    Liu Feng-li
    Wang Yun-feng
    Chu Chao-Hsien
    PROCEEDINGS OF THE 2006 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING (13TH), VOLS 1-3, 2006, : 223 - 228
  • [49] Data types generalization and transformation for data mining algorithms
    Jiang, M.F.
    Tseng, S.S.
    Liao, S.Y.
    International Journal of Engineering Intelligent Systems for Electrical Engineering and Communications, 2000, 8 (02): : 89 - 95
  • [50] Data types generalization and transformation for data mining algorithms
    Jiang, MF
    Tseng, SS
    Liao, SY
    ENGINEERING INTELLIGENT SYSTEMS FOR ELECTRICAL ENGINEERING AND COMMUNICATIONS, 2000, 8 (02): : 89 - 95