Embedding Principle in Depth for the Loss Landscape Analysis of Deep Neural Networks

被引:0
|
作者
Bai, Zhiwei [1 ]
Luo, Tao [1 ,2 ]
Xu, Zhi-Qin John [1 ]
Zhang, Yaoyu [1 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Math Sci, Inst Nat Sci, Shanghai 200240, Peoples R China
[2] CMA Shanghai, Shanghai Artificial Intelligence Lab, Shanghai 200240, Peoples R China
[3] Shanghai Ctr Brain Sci & Brain Inspired Technol, Shanghai 200240, Peoples R China
来源
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Deep learning; loss landscape; embedding principle;
D O I
10.4208/csiam-am.SO-2023-0020
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this work, we delve into the relationship between deep and shallow neural networks (NNs), focusing on the critical points of their loss landscapes. We discover an embedding principle in depth that loss landscape of an NN "contains" all critical points of the loss landscapes for shallower NNs. The key tool for our discovery is the critical lifting that maps any critical point of a network to critical manifolds of any deeper network while preserving the outputs. To investigate the practical implications of this principle, we conduct a series of numerical experiments. The results confirm that deep networks do encounter these lifted critical points during training, leading to similar training dynamics across varying network depths. We provide theoretical and empirical evidence that through the lifting operation, the lifted critical points exhibit increased degeneracy. This principle also provides insights into the optimization benefits of batch normalization and larger datasets, and enables practical applications like network layer pruning. Overall, our discovery of the embedding principle in depth uncovers the depth-wise hierarchical structure of deep learning loss landscape, which serves as a solid foundation for the further study about the role of depth for DNNs.
引用
收藏
页码:350 / 389
页数:40
相关论文
共 50 条
  • [31] An adaptive embedding procedure for time series forecasting with deep neural networks
    Succetti, Federico
    Rosato, Antonello
    Panella, Massimo
    NEURAL NETWORKS, 2023, 167 : 715 - 729
  • [32] Location Embedding and Deep Convolutional Neural Networks for Next Location Prediction
    Sassi, Abdessamed
    Brahimi, Mohammed
    Bechkit, Walid
    Bachir, Abdelmalik
    2019 IEEE 44TH LOCAL COMPUTER NETWORKS (LCN) SYMPOSIUM ON EMERGING TOPICS IN NETWORKING (LCN SYMPOSIUM 2019), 2019, : 149 - 157
  • [33] Efficient Training of Deep Convolutional Neural Networks by Augmentation in Embedding Space
    Abrishami, Mohammad Saeed
    Eshratifar, Amir Erfan
    Eigen, David
    Wang, Yanzhi
    Nazarian, Shahin
    Pedram, Massoud
    PROCEEDINGS OF THE TWENTYFIRST INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2020), 2020, : 347 - 351
  • [34] Deep Topological Embedding with Convolutional Neural Networks for Complex Network Classification
    Scabini, Leonardo
    Ribas, Lucas
    Ribeiro, Eraldo
    Bruno, Odemir
    NETWORK SCIENCE (NETSCI-X 2022), 2022, 13197 : 54 - 66
  • [35] Local depth edge detection in humans and deep neural networks
    Ehinger, Krista A.
    Adams, Wendy J.
    Graf, Erich W.
    Elder, James H.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2681 - 2689
  • [36] An In-depth Comparison of Compilers for Deep Neural Networks on Hardware
    Xing, Yu
    Weng, Jian
    Wang, Yushun
    Sui, Lingzhi
    Shan, Yi
    Wang, Yu
    2019 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2019,
  • [37] Curriculum Learning for Depth Estimation with Deep Convolutional Neural Networks
    Surendranath, Ajay
    Jayagopi, Dinesh Babu
    PROCEEDINGS OF THE 2ND MEDITERRANEAN CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (MEDPRAI-2018), 2018, : 95 - 100
  • [38] Fast Depth Reconstruction Using Deep Convolutional Neural Networks
    Maslov, Dmitrii
    Makarov, Ilya
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 456 - 467
  • [39] On the turnpike to design of deep neural networks: Explicit depth bounds
    Faulwasser, Timm
    Hempel, Arne-Jens
    Streif, Stefan
    IFAC JOURNAL OF SYSTEMS AND CONTROL, 2024, 30
  • [40] Can Unstructured Pruning Reduce the Depth in Deep Neural Networks?
    Liao, Zhu
    Quetu, Victor
    Nguyen, Van-Tam
    Tartaglione, Enzo
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 1394 - 1398