Learning Layer-wise Equivariances Automatically using Gradients

被引:0
|
作者
van der Ouderaa, Tycho F. A. [1 ]
Immer, Alexander [2 ,3 ]
van der Wilk, Mark [1 ,4 ]
机构
[1] Imperial Coll London, Dept Comp, London, England
[2] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
[3] Max Planck Inst Intelligent Syst, Tubingen, Germany
[4] Univ Oxford, Dept Comp Sci, Oxford, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutions encode equivariance symmetries into neural networks leading to better generalisation performance. However, symmetries provide fixed hard constraints on the functions a network can represent, need to be specified in advance, and can not be adapted. Our goal is to allow flexible symmetry constraints that can automatically be learned from data using gradients. Learning symmetry and associated weight connectivity structures from scratch is difficult for two reasons. First, it requires efficient and flexible parameterisations of layer-wise equivariances. Secondly, symmetries act as constraints and are therefore not encouraged by training losses measuring data fit. To overcome these challenges, we improve parameterisations of soft equivariance and learn the amount of equivariance in layers by optimising the marginal likelihood, estimated using differentiable Laplace approximations. The objective balances data fit and model complexity enabling layer-wise symmetry discovery in deep networks. We demonstrate the ability to automatically learn layer-wise equivariances on image classification tasks, achieving equivalent or improved performance over baselines with hard-coded symmetry.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Towards Layer-wise Image Vectorization
    Ma, Xu
    Zhou, Yuqian
    Xu, Xingqian
    Sun, Bin
    Filev, Valerii
    Orlov, Nikita
    Fu, Yun
    Shi, Humphrey
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16293 - 16302
  • [32] Learning Feature Hierarchies: A Layer-Wise Tag-Embedded Approach
    Yuan, Zhaoquan
    Xu, Changsheng
    Sang, Jitao
    Yan, Shuicheng
    Hossain, M. Shamim
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (06) : 816 - 827
  • [33] WAYS OF IMPROVING LAYER-WISE CARBONISATION
    SYSKOV, KI
    RAKHANSK.PD
    COKE & CHEMISTRY USSR, 1970, (07): : 13 - &
  • [34] Comparisons Where It Matters: Using Layer-Wise Regularization to Improve Federated Learning on Heterogeneous Data
    Son, Ha Min
    Kim, Moon Hyun
    Chung, Tai-Myoung
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [35] A Layer-Wise Theoretical Framework for Deep Learning of Convolutional Neural Networks
    Huu-Thiet Nguyen
    Li, Sitan
    Cheah, Chien Chern
    IEEE ACCESS, 2022, 10 : 14270 - 14287
  • [36] Cost-Sensitive Deep Learning with Layer-Wise Cost Estimation
    Chung, Yu-An
    Yang, Shao-Wen
    Lin, Hsuan-Tien
    2020 25TH INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2020), 2020, : 108 - 113
  • [37] Fed-LAMB: Layer-wise and Dimension-wise Locally Adaptive Federated Learning
    Karimi, Belhal
    Li, Ping
    Li, Xiaoyun
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 1037 - 1046
  • [38] An Automatically Layer-Wise Searching Strategy for Channel Pruning Based on Task-Driven Sparsity Optimization
    Feng, Kai-Yuan
    Fei, Xia
    Gong, Maoguo
    Qin, A. K.
    Li, Hao
    Wu, Yue
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 5790 - 5802
  • [39] Personalized Federated Learning with Layer-Wise Feature Transformation via Meta-Learning
    Tu, Jingke
    Huang, Jiaming
    Yang, Lei
    Lin, Wanyu
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)
  • [40] Dynamics of rectangular laminated composite plates with selective layer-wise fillering rested on elastic foundation using higher-order layer-wise theory
    Parida, Sarada P.
    Jena, Pankaj C.
    Dash, Rati R.
    JOURNAL OF VIBRATION AND CONTROL, 2023, 29 (23-24) : 5598 - 5615