Tensor-tensor algebra for optimal representation and compression of multiway data

被引：58

作者：

Kilmer, Misha E. ^{[1
]}

Horesh, Lior ^{[2
]}

Avron, Haim ^{[3
]}

Newman, Elizabeth ^{[4
]}

机构：

[1] Tufts Univ, Dept Math, Medford, MA 02155 USA

[2] IBM Res, Math AI, Yorktown Hts, NY 10598 USA

[3] Tel Aviv Univ, Sch Math Sci, IL-6997801 Tel Aviv, Israel

[4] Emory Univ, Dept Math, Atlanta, GA 30322 USA

来源：

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA | 2021年 / 118卷 / 28期

关键词：

tensor  compression  multiway data  SVD  rank; FACTORIZATION;

D O I：

10.1073/pnas.2015851118

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

With the advent of machine learning and its overarching pervasiveness it is imperative to devise ways to represent large datasets efficiently while distilling intrinsic features necessary for subsequent analysis. The primary workhorse used in data dimensionality reduction and feature extraction has been the matrix singular value decomposition (SVD), which presupposes that data have been arranged in matrix format. A primary goal in this study is to show that high-dimensional datasets are more compressible when treated as tensors (i.e., multiway arrays) and compressed via tensor-SVDs under the tensor-tensor product constructs and its generalizations. We begin by proving Eckart-Young optimality results for families of tensor-SVDs under two different truncation strategies. Since such optimality properties can be proven in both matrix and tensor-based algebras, a fundamental question arises: Does the tensor construct subsume the matrix construct in terms of representation efficiency? The answer is positive, as proven by showing that a tensor-tensor representation of an equal dimensional spanning space can be superior to its matrix counterpart. We then use these optimality results to investigate how the compressed representation provided by the truncated tensor SVD is related both theoretically and empirically to its two closest tensor based analogs, the truncated high-order SVD and the truncated tensor-train SVD.

引用

页数：12

共 50 条

[41] Efficient multichannel EEG compression by optimal tensor truncation
Das, Samiran
Kyal, Chirag
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 68
[42] Semantic Data Representation for Improving Tensor Factorization
Nakatsuji, Makoto
Fujiwara, Yasuhiro
Toda, Hiroyuki
Sawada, Hiroshi
Zheng, Jin
Hendler, James A.
PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2004 - 2012
[43] TTHRESH: Tensor Compression for Multidimensional Visual Data
Ballester-Ripoll, Rafael
Lindstrom, Peter
Pajarola, Renato
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (09) : 2891 - 2903
[44] A Tensor Framework for Data Stream Clustering and Compression
Cyganek, Boguslaw
Wozniak, Michal
IMAGE ANALYSIS AND PROCESSING,(ICIAP 2017), PT I, 2017, 10484 : 163 - 173
[45] Multiway clustering via tensor block models
Wang, Miaoyan
Zeng, Yuchen
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[46] Second-order cosmological perturbations. IV. Produced by scalar-tensor and tensor-tensor couplings during the radiation dominated stage
Wang, Bo
Zhang, Yang
PHYSICAL REVIEW D, 2019, 99 (12)
[47] Linear algebra for tensor problems
Oseledets, I. V.
Savostyanov, D. V.
Tyrtyshnikov, E. E.
COMPUTING, 2009, 85 (03) : 169 - 188
[48] Linear algebra for tensor problems
I. V. Oseledets
D. V. Savostyanov
E. E. Tyrtyshnikov
Computing, 2009, 85 : 169 - 188
[49] EXTENSIONS OF THE TENSOR ALGEBRA AND THEIR APPLICATIONS
Itoh, Minoru
COMMUNICATIONS IN ALGEBRA, 2012, 40 (09) : 3442 - 3493
[50] Tensor extension of the Poincare algebra
Soroka, DV
Soroka, VA
PHYSICS LETTERS B, 2005, 607 (3-4) : 302 - 305

← 1 2 3 4 5 →