Reviewing Evolution of Learning Functions and Semantic Information Measures for Understanding Deep Learning

被引:2
|
作者
Lu, Chenguang [1 ,2 ]
机构
[1] Liaoning Tech Univ, Intelligence Engn & Math Inst, Fuxin 123000, Peoples R China
[2] Changsha Univ, Sch Comp Engn & Appl Math, Changsha 410022, Peoples R China
关键词
deep learning; learning function; semantic information measure; estimated mutual information; maximum mutual information; generalized entropy; similarity function; SoftMax function; Restricted Boltzmann Machine; SIMILARITY; ALGORITHM;
D O I
10.3390/e25050802
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
A new trend in deep learning, represented by Mutual Information Neural Estimation (MINE) and Information Noise Contrast Estimation (InfoNCE), is emerging. In this trend, similarity functions and Estimated Mutual Information (EMI) are used as learning and objective functions. Coincidentally, EMI is essentially the same as Semantic Mutual Information (SeMI) proposed by the author 30 years ago. This paper first reviews the evolutionary histories of semantic information measures and learning functions. Then, it briefly introduces the author's semantic information G theory with the rate-fidelity function R(G) (G denotes SeMI, and R(G) extends R(D)) and its applications to multi-label learning, the maximum Mutual Information (MI) classification, and mixture models. Then it discusses how we should understand the relationship between SeMI and Shannon's MI, two generalized entropies (fuzzy entropy and coverage entropy), Autoencoders, Gibbs distributions, and partition functions from the perspective of the R(G) function or the G theory. An important conclusion is that mixture models and Restricted Boltzmann Machines converge because SeMI is maximized, and Shannon's MI is minimized, making information efficiency G/R close to 1. A potential opportunity is to simplify deep learning by using Gaussian channel mixture models for pre-training deep neural networks' latent layers without considering gradients. It also discusses how the SeMI measure is used as the reward function (reflecting purposiveness) for reinforcement learning. The G theory helps interpret deep learning but is far from enough. Combining semantic information theory and deep learning will accelerate their development.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Gradient estimation of information measures in deep learning
    Wen, Liangjian
    Bai, Haoli
    He, Lirong
    Zhou, Yiji
    Zhou, Mingyuan
    Xu, Zenglin
    KNOWLEDGE-BASED SYSTEMS, 2021, 224
  • [2] Understanding the Evolution of Linear Regions in Deep Reinforcement Learning
    Cohan, Setareh
    Kim, Nam Hee
    Rolnick, David
    van de Panne, Michiel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [3] Organizational learning and the learning organization Reviewing evolution for prospecting the future
    Rebelo, Teresa Manuela
    Gomes, Adelino Duarte
    LEARNING ORGANIZATION, 2008, 15 (04): : 294 - +
  • [4] Building Semantic Understanding beyond Deep Learning from Sound and Vision
    de Souza, Fillipe D. M.
    Sarkar, Sudeep
    Camara-Chavez, Guillermo
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2097 - 2102
  • [5] Understanding Deep Learning
    Wang, Ge
    NATURE MACHINE INTELLIGENCE, 2024, 6 (05) : 502 - 503
  • [6] Semantic Structure in Deep Learning
    Pavlick, Ellie
    ANNUAL REVIEW OF LINGUISTICS, 2022, 8 : 447 - 471
  • [7] Semantic Adversarial Deep Learning
    Seshia, Sanjit A.
    Jha, Somesh
    Dreossi, Tommaso
    IEEE DESIGN & TEST, 2020, 37 (02) : 8 - 18
  • [8] Semantic Adversarial Deep Learning
    Dreossi, Tommaso
    Jha, Somesh
    Seshia, Sanjit A.
    COMPUTER AIDED VERIFICATION (CAV 2018), PT I, 2018, 10981 : 3 - 26
  • [9] Semantic Information in Contrastive Learning
    Quan, Shengjiang
    Hirano, Masahiro
    Yamakawa, Yuji
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5663 - 5673
  • [10] DEEP LEARNING BASED CLASSIFICATION USING SEMANTIC INFORMATION FOR POLSAR IMAGE
    Zhang, Lu
    Xie, Wen
    Zhao, Feng
    Liu, Hanqiang
    Duan, Yiping
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 196 - 199