Information-Theoretic GAN Compression with Variational Energy-based Model

被引:0
|
作者
Kang, Minsoo [1 ]
Yoo, Hyewon [2 ]
Kang, Eunhee [3 ]
Ki, Sehwan [3 ]
Lee, Hyong-Euk [3 ]
Han, Bohyung [1 ,2 ]
机构
[1] Seoul Natl Univ, ECE, Seoul, South Korea
[2] Seoul Natl Univ, IPAI, Seoul, South Korea
[3] Samsung Adv Inst Technol SAIT, Suwon, South Korea
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an information-theoretic knowledge distillation approach for the compression of generative adversarial networks, which aims to maximize the mutual information between teacher and student networks via a variational optimization based on an energy-based model. Because the direct computation of the mutual information in continuous domains is intractable, our approach alternatively optimizes the student network by maximizing the variational lower bound of the mutual information. To achieve a tight lower bound, we introduce an energy-based model relying on a deep neural network to represent a flexible variational distribution that deals with high-dimensional images and consider spatial dependencies between pixels, effectively. Since the proposed method is a generic optimization algorithm, it can be conveniently incorporated into arbitrary generative adversarial networks and even dense prediction networks, e.g., image enhancement models. We demonstrate that the proposed algorithm achieves outstanding performance in model compression of generative adversarial networks consistently when combined with several existing models.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Information-Theoretic Occupancy Grid Compression for High-Speed Information-Based Exploration
    Nelson, Erik
    Michael, Nathan
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 4976 - 4982
  • [22] On the Energy-Based Variational Model for Vector Magnetic Hysteresis
    Prigozhin, Leonid
    Sokolovsky, Vladimir
    Barrett, John W.
    Zirka, Sergey E.
    IEEE TRANSACTIONS ON MAGNETICS, 2016, 52 (12)
  • [23] Information-theoretic assessment of imaging systems via data compression
    Aiazzi, B
    Alparone, L
    Baronti, S
    MATHEMATICS OF DATA/IMAGE CODING, COMPRESSION, AND ENCRYPTION IV, WITH APPLICATIONS, 2001, 4475 : 55 - 66
  • [24] Emergence of genetic coding: An information-theoretic model
    Piraveenan, Mahendra
    Polani, Daniel
    Prokopenko, Mikhail
    ADVANCES IN ARTIFICIAL LIFE, PROCEEDINGS, 2007, 4648 : 42 - +
  • [25] AN INFORMATION-THEORETIC MODEL FOR SERIAL POSITION EFFECT
    THOMAS, HBG
    PSYCHOLOGICAL REVIEW, 1968, 75 (05) : 409 - +
  • [26] Information-Theoretic Foundation for the Weighted Updating Model
    Zinn, Jesse Aaron
    REVIEW OF BEHAVIORAL ECONOMICS, 2019, 6 (01): : 39 - 51
  • [27] Information-Theoretic Model Selection for Independent Components
    Plant, Claudia
    Theis, Fabian J.
    Meyer-Baese, Anke
    Boehm, Christian
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, 2010, 6365 : 254 - +
  • [28] Zero and negative energy dissipation at information-theoretic erasure
    Laszlo Bela Kish
    Claes-Göran Granqvist
    Sunil P. Khatri
    Ferdinand Peper
    Journal of Computational Electronics, 2016, 15 : 335 - 339
  • [29] An information-theoretic instance-based classifier
    Gokcay, Erhan
    INFORMATION SCIENCES, 2020, 536 : 263 - 276
  • [30] Information density converges in dialogue: Towards an information-theoretic model
    Xu, Yang
    Reitter, David
    COGNITION, 2018, 170 : 147 - 163