Neural (Tangent Kernel) Collapse

被引:0
|
作者
Seleznova, Mariia [1 ]
Weitzner, Dana [2 ]
Giryes, Raja [2 ]
Kutyniok, Gitta [1 ]
Chou, Hung-Hsu [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Munich, Germany
[2] Tel Aviv Univ, Tel Aviv, Israel
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work bridges two important concepts: the Neural Tangent Kernel (NTK), which captures the evolution of deep neural networks (DNNs) during training, and the Neural Collapse (NC) phenomenon, which refers to the emergence of symmetry and structure in the last-layer features of well-trained classification DNNs. We adopt the natural assumption that the empirical NTK develops a block structure aligned with the class labels, i.e., samples within the same class have stronger correlations than samples from different classes. Under this assumption, we derive the dynamics of DNNs trained with mean squared (MSE) loss and break them into interpretable phases. Moreover, we identify an invariant that captures the essence of the dynamics, and use it to prove the emergence of NC in DNNs with block-structured NTK. We provide large-scale numerical experiments on three common DNN architectures and three benchmark datasets to support our theory.
引用
收藏
页数:31
相关论文
共 50 条
  • [21] Analyzing Finite Neural Networks: CanWe Trust Neural Tangent Kernel Theory?
    Seleznova, Mariia
    Kutyniok, Gitta
    MATHEMATICAL AND SCIENTIFIC MACHINE LEARNING, VOL 145, 2021, 145 : 868 - 895
  • [22] Quantum-classical hybrid neural networks in the neural tangent kernel regime
    Nakaji, Kouhei
    Tezuka, Hiroyuki
    Yamamoto, Naoki
    QUANTUM SCIENCE AND TECHNOLOGY, 2024, 9 (01)
  • [23] Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels
    Du, Simon S.
    Hou, Kangcheng
    Poczos, Barnabas
    Salakhutdinov, Ruslan
    Wang, Ruosong
    Xu, Keyulu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [24] The Quantum Path Kernel: A Generalized Neural Tangent Kernel for Deep Quantum Machine Learning
    Incudini M.
    Grossi M.
    Mandarino A.
    Vallecorsa S.
    Pierro A.D.
    Windridge D.
    IEEE Transactions on Quantum Engineering, 2023, 4
  • [25] Fast Graph Neural Tangent Kernel via Kronecker Sketching
    Jiang, Shunhua
    Man, Yunze
    Song, Zhao
    Yu, Zheng
    Zhuo, Danyang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7033 - 7041
  • [26] Spectral Analysis of the Neural Tangent Kernel for Deep Residual Networks
    Belfer, Yuval
    Geifman, Amnon
    Galun, Meirav
    Basri, Ronen
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 49
  • [27] Multi-Angle Fast Neural Tangent Kernel Classifier
    Zhai, Yuejing
    Li, Zhouzheng
    Liu, Haizhong
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [28] Deep learning in random neural fields: Numerical experiments via neural tangent kernel
    Watanabe, Kaito
    Sakamoto, Kotaro
    Karakida, Ryo
    Sonoda, Sho
    Amari, Shun-ichi
    NEURAL NETWORKS, 2023, 160 : 148 - 163
  • [29] Stability & Generalisation of Gradient Descent for Shallow Neural Networks without the Neural Tangent Kernel
    Richards, Dominic
    Kuzborskij, Ilja
    Advances in Neural Information Processing Systems, 2021, 11 : 8609 - 8621
  • [30] Stability & Generalisation of Gradient Descent for Shallow Neural Networks without the Neural Tangent Kernel
    Richards, Dominic
    Kuzborskij, Ilja
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,