Efficient Low-Dimensional Compression of Overparameterized Models

被引:0
|
作者
Kwon, Soo Min [1 ]
Zhang, Zekai [2 ]
Song, Dogyoon [1 ]
Balzano, Laura [1 ]
Qu, Qing [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Tsinghua Univ, Beijing, Peoples R China
关键词
RANK;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present a novel approach for compressing overparameterized models, developed through studying their learning dynamics. We observe that for many deep models, updates to the weight matrices occur within a low-dimensional invariant subspace. For deep linear models, we demonstrate that their principal components are fitted incrementally within a small subspace, and use these insights to propose a compression algorithm for deep linear networks that involve decreasing the width of their intermediate layers. We empirically evaluate the effectiveness of our compression technique on matrix recovery problems. Remarkably, by using an initialization that exploits the structure of the problem, we observe that our compressed network converges faster than the original network, consistently yielding smaller recovery errors. We substantiate this observation by developing a theory focused on deep matrix factorization. Finally, we empirically demonstrate how our compressed model has the potential to improve the utility of deep nonlinear models. Overall, our algorithm improves the training efficiency by more than 2x, without compromising generalization.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] Equilibration in low-dimensional quantum matrix models
    R. Hübener
    Y. Sekino
    J. Eisert
    Journal of High Energy Physics, 2015
  • [22] Low-dimensional models of coherent structures in turbulence
    Holmes, PJ
    Lumley, JL
    Berkooz, G
    Mattingly, JC
    Wittenberg, RW
    PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 1997, 287 (04): : 337 - 384
  • [23] On low-dimensional Galerkin models for fluid flow
    Rempfer, D
    THEORETICAL AND COMPUTATIONAL FLUID DYNAMICS, 2000, 14 (02) : 75 - 88
  • [24] A Low-Dimensional Function Space for Efficient Spectral Upsampling
    Jakob, Wenzel
    Hanika, Johannes
    COMPUTER GRAPHICS FORUM, 2019, 38 (02) : 147 - 155
  • [25] Low-dimensional models of thin film fluid dynamics
    Physics Letters. Section A: General, Atomic and Solid State Physics, 1996, 212 (1-2):
  • [26] Direct estimation of low-dimensional components in additive models
    Fan, JQ
    Härdle, W
    Mammen, E
    ANNALS OF STATISTICS, 1998, 26 (03): : 943 - 971
  • [27] DROPLET THEORY OF LOW-DIMENSIONAL ISING-MODELS
    BRUCE, AD
    WALLACE, DJ
    PHYSICAL REVIEW LETTERS, 1981, 47 (24) : 1743 - 1746
  • [28] Phase Transitions in Low-Dimensional Disordered Potts Models
    Babaev, A. B.
    Murtazaev, A. K.
    PHYSICS OF THE SOLID STATE, 2020, 62 (05) : 851 - 855
  • [29] Analysis of a Class of Low-Dimensional Models of Mutation and Predation
    Abernethy, Gavin M.
    McCartney, Mark
    INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2016, 26 (11):
  • [30] Low-dimensional models from upper bound theory
    Chini, Gregory P.
    Dianati, Navid
    Zhang, Zhexuan
    Doering, Charles R.
    PHYSICA D-NONLINEAR PHENOMENA, 2011, 240 (02) : 241 - 248