Theoretical Analysis of Inductive Biases in Deep Convolutional Networks

被引：0

作者：

Wang, Zihao ^{[1
]}

Wu, Lei ^{[1
]}

机构：

[1] Peking Univ, Beijing, Peoples R China

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we provide a theoretical analysis of the inductive biases in convolutional neural networks (CNNs). We start by examining the universality of CNNs, i.e., the ability to approximate any continuous functions. We prove that a depth of O(log d) suffices for deep CNNs to achieve this universality, where d in the input dimension. Additionally, we establish that learning sparse functions with CNNs requires only (O) over tilde (log(2) d) samples, indicating that deep CNNs can efficiently capture long-range sparse correlations. These results are made possible through a novel combination of the multichanneling and downsampling when increasing the network depth. We also delve into the distinct roles of weight sharing and locality in CNNs. To this end, we compare the performance of CNNs, locally-connected networks (LCNs), and fully-connected networks (FCNs) on a simple regression task, where LCNs can be viewed as CNNs without weight sharing. On the one hand, we prove that LCNs require Omega(d) samples while CNNs need only (O) over tilde (log(2) d) samples, highlighting the critical role of weight sharing. On the other hand, we prove that FCNs require Omega(d(2)) samples, whereas LCNs need only (O) over tilde (d) samples, underscoring the importance of locality. These provable separations quantify the difference between the two biases, and the major observation behind our proof is that weight sharing and locality break different symmetries in the learning process.

引用

页数：50

共 50 条

[1] Theoretical Scalability Analysis of Distributed Deep Convolutional Neural Networks
Castello, Adrian
Dolz, Manuel F.
Quintana-Orti, Enrique S.
Duato, Jose
2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 534 - 541
[2] On the inductive biases of deep domain adaptation
Siry, Rodrigue
Hemadou, Louis
Simon, Loic
Jurie, Frederic
COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 233
[3] ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
d'Ascoli, Stephane
Touvron, Hugo
Leavitt, Matthew L.
Morcos, Ari S.
Biroli, Giulio
Sagun, Levent
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[4] ConViT: improving vision transformers with soft convolutional inductive biases
d'Ascoli, Stephane
Touvron, Hugo
Leavitt, Matthew L.
Morcos, Ari S.
Biroli, Giulio
Sagun, Levent
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2022, 2022 (11):
[5] Demystifying the Hypercomplex: Inductive biases in hypercomplex deep learning
Comminiello, Danilo
Grassucci, Eleonora
Mandic, Danilo P.
Uncini, Aurelio
IEEE SIGNAL PROCESSING MAGAZINE, 2024, 41 (03) : 59 - 71
[6] Theoretical and empirical evidence for the impact of inductive biases on cultural evolution
Griffiths, Thomas L.
Kalish, Michael L.
Lewandowsky, Stephan
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1509) : 3503 - 3514
[7] Inductive biases for deep learning of higher-level cognition
Goyal, Anirudh
Bengio, Yoshua
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2022, 478 (2266):
[8] Twitter Sentiment Analysis with Deep Convolutional Neural Networks
Severyn, Aliaksei
Moschitti, Alessandro
SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 959 - 962
[9] Towards Better Analysis of Deep Convolutional Neural Networks
Liu, Mengchen
Shi, Jiaxin
Li, Zhen
Li, Chongxuan
Zhu, Jun
Liu, Shixia
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2017, 23 (01) : 91 - 100
[10] A Layer-Wise Theoretical Framework for Deep Learning of Convolutional Neural Networks
Huu-Thiet Nguyen
Li, Sitan
Cheah, Chien Chern
IEEE ACCESS, 2022, 10 : 14270 - 14287

← 1 2 3 4 5 →