Theoretical Analysis of Inductive Biases in Deep Convolutional Networks

被引:0
|
作者
Wang, Zihao [1 ]
Wu, Lei [1 ]
机构
[1] Peking Univ, Beijing, Peoples R China
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we provide a theoretical analysis of the inductive biases in convolutional neural networks (CNNs). We start by examining the universality of CNNs, i.e., the ability to approximate any continuous functions. We prove that a depth of O(log d) suffices for deep CNNs to achieve this universality, where d in the input dimension. Additionally, we establish that learning sparse functions with CNNs requires only (O) over tilde (log(2) d) samples, indicating that deep CNNs can efficiently capture long-range sparse correlations. These results are made possible through a novel combination of the multichanneling and downsampling when increasing the network depth. We also delve into the distinct roles of weight sharing and locality in CNNs. To this end, we compare the performance of CNNs, locally-connected networks (LCNs), and fully-connected networks (FCNs) on a simple regression task, where LCNs can be viewed as CNNs without weight sharing. On the one hand, we prove that LCNs require Omega(d) samples while CNNs need only (O) over tilde (log(2) d) samples, highlighting the critical role of weight sharing. On the other hand, we prove that FCNs require Omega(d(2)) samples, whereas LCNs need only (O) over tilde (d) samples, underscoring the importance of locality. These provable separations quantify the difference between the two biases, and the major observation behind our proof is that weight sharing and locality break different symmetries in the learning process.
引用
收藏
页数:50
相关论文
共 50 条
  • [41] Wrist Pulse Signals Analysis based on Deep Convolutional Neural Networks
    Hu Xiaojuan
    Zhu Honghai
    Xu Jiatuo
    Xu Dongrong
    Dong Jun
    2014 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2014,
  • [42] Deep convolutional generative adversarial networks: performance analysis in wireless systems
    Shams Fardous Arnab
    S. M. Ibnul Ul Alam
    Tasfin Mahmud
    Saifur Rahman Sabuj
    Shakil Ahmed
    Discover Internet of Things, 4 (1):
  • [43] Structure-functional analysis and synthesis of deep convolutional neural networks
    Vizilter, Yu. V.
    Gorbatsevich, V. S.
    Zheltov, S. Y.
    COMPUTER OPTICS, 2019, 43 (05) : 886 - 900
  • [44] Deep Convolutional Neural Networks for Breast Cancer Histology Image Analysis
    Rakhlin, Alexander
    Shvets, Alexey
    Iglovikov, Vladimir
    Kalinin, Alexandr A.
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2018), 2018, 10882 : 737 - 744
  • [45] Hybrid deep graph convolutional networks
    Yang, Fei
    Zhang, Huyin
    Tao, Shiming
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (08) : 2239 - 2255
  • [46] Simple and Deep Graph Convolutional Networks
    Chen, Ming
    Wei, Zhewei
    Huang, Zengfeng
    Ding, Bolin
    Li, Yaliang
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [47] Hybrid deep graph convolutional networks
    Fei Yang
    Huyin Zhang
    Shiming Tao
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 2239 - 2255
  • [48] Simple and Deep Graph Convolutional Networks
    Chen, Ming
    Wei, Zhewei
    Huang, Zengfeng
    Ding, Bolin
    Li, Yaliang
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [49] Lipschitz properties for deep convolutional networks
    Balan, Radu
    Singh, Maneesh
    Zou, Dongmian
    FRAMES AND HARMONIC ANALYSIS, 2018, 706 : 129 - 151
  • [50] Universality of deep convolutional neural networks
    Zhou, Ding-Xuan
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2020, 48 (02) : 787 - 794