Theoretical Analysis of Inductive Biases in Deep Convolutional Networks

被引:0
|
作者
Wang, Zihao [1 ]
Wu, Lei [1 ]
机构
[1] Peking Univ, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we provide a theoretical analysis of the inductive biases in convolutional neural networks (CNNs). We start by examining the universality of CNNs, i.e., the ability to approximate any continuous functions. We prove that a depth of O(log d) suffices for deep CNNs to achieve this universality, where d in the input dimension. Additionally, we establish that learning sparse functions with CNNs requires only (O) over tilde (log(2) d) samples, indicating that deep CNNs can efficiently capture long-range sparse correlations. These results are made possible through a novel combination of the multichanneling and downsampling when increasing the network depth. We also delve into the distinct roles of weight sharing and locality in CNNs. To this end, we compare the performance of CNNs, locally-connected networks (LCNs), and fully-connected networks (FCNs) on a simple regression task, where LCNs can be viewed as CNNs without weight sharing. On the one hand, we prove that LCNs require Omega(d) samples while CNNs need only (O) over tilde (log(2) d) samples, highlighting the critical role of weight sharing. On the other hand, we prove that FCNs require Omega(d(2)) samples, whereas LCNs need only (O) over tilde (d) samples, underscoring the importance of locality. These provable separations quantify the difference between the two biases, and the major observation behind our proof is that weight sharing and locality break different symmetries in the learning process.
引用
收藏
页数:50
相关论文
共 50 条
  • [1] Theoretical Scalability Analysis of Distributed Deep Convolutional Neural Networks
    Castello, Adrian
    Dolz, Manuel F.
    Quintana-Orti, Enrique S.
    Duato, Jose
    2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 534 - 541
  • [2] On the inductive biases of deep domain adaptation
    Siry, Rodrigue
    Hemadou, Louis
    Simon, Loic
    Jurie, Frederic
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 233
  • [3] ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
    d'Ascoli, Stephane
    Touvron, Hugo
    Leavitt, Matthew L.
    Morcos, Ari S.
    Biroli, Giulio
    Sagun, Levent
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [4] ConViT: improving vision transformers with soft convolutional inductive biases
    d'Ascoli, Stephane
    Touvron, Hugo
    Leavitt, Matthew L.
    Morcos, Ari S.
    Biroli, Giulio
    Sagun, Levent
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2022, 2022 (11):
  • [5] Demystifying the Hypercomplex: Inductive biases in hypercomplex deep learning
    Comminiello, Danilo
    Grassucci, Eleonora
    Mandic, Danilo P.
    Uncini, Aurelio
    IEEE SIGNAL PROCESSING MAGAZINE, 2024, 41 (03) : 59 - 71
  • [6] Theoretical and empirical evidence for the impact of inductive biases on cultural evolution
    Griffiths, Thomas L.
    Kalish, Michael L.
    Lewandowsky, Stephan
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1509) : 3503 - 3514
  • [7] Inductive biases for deep learning of higher-level cognition
    Goyal, Anirudh
    Bengio, Yoshua
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2022, 478 (2266):
  • [8] Twitter Sentiment Analysis with Deep Convolutional Neural Networks
    Severyn, Aliaksei
    Moschitti, Alessandro
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 959 - 962
  • [9] Towards Better Analysis of Deep Convolutional Neural Networks
    Liu, Mengchen
    Shi, Jiaxin
    Li, Zhen
    Li, Chongxuan
    Zhu, Jun
    Liu, Shixia
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2017, 23 (01) : 91 - 100
  • [10] A Layer-Wise Theoretical Framework for Deep Learning of Convolutional Neural Networks
    Huu-Thiet Nguyen
    Li, Sitan
    Cheah, Chien Chern
    IEEE ACCESS, 2022, 10 : 14270 - 14287