Data Symmetries and Learning in Fully Connected Neural Networks

被引:1
|
作者
Anselmi, Fabio [1 ,2 ]
Manzoni, Luca [1 ]
D'onofrio, Alberto [1 ]
Rodriguez, Alex [1 ]
Caravagna, Giulio [1 ]
Bortolussi, Luca [1 ]
Cairoli, Francesca [1 ]
机构
[1] Univ Trieste, Dept Math & Geosci, I-34127 Trieste, Italy
[2] MIT, McGovern Inst, Ctr Brains Minds & Machines, Cambridge, MA 02139 USA
关键词
Orbits; Finite element analysis; Reflection; Task analysis; Complexity theory; Artificial neural networks; Machine learning; symmetry invariance; equivariance; INVARIANT OBJECT RECOGNITION; PATTERN-RECOGNITION; SIZE-INVARIANT; SHIFT;
D O I
10.1109/ACCESS.2023.3274938
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Symmetries in the data and how they constrain the learned weights of modern deep networks is still an open problem. In this work we study the simple case of fully connected shallow non-linear neural networks and consider two types of symmetries: full dataset symmetries where the dataset X is mapped into itself by any transformation g, i.e. gX = X or single data point symmetries where gx = x, x ? X. We prove and experimentally confirm that symmetries in the data are directly inherited at the level of the network's learned weights and relate these findings with the common practice of data augmentation in modern machine learning. Finally, we show how symmetry constraints have a profound impact on the spectrum of the learned weights, an aspect of the so-called network implicit bias.
引用
收藏
页码:47282 / 47290
页数:9
相关论文
共 50 条
  • [31] A novel structured sparse fully connected layer in convolutional neural networks
    Matsumura, Naoki
    Ito, Yasuaki
    Nakano, Koji
    Kasagi, Akihiko
    Tabaru, Tsuguchika
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (11):
  • [32] Parallel dynamics of fully connected Q-Ising neural networks
    Bolle, D
    Jongen, G
    Shim, GM
    JOURNAL OF STATISTICAL PHYSICS, 1998, 91 (1-2) : 125 - 153
  • [33] Thermodynamics of fully connected Blume-Emery-Griffiths neural networks
    Bollé, D
    Verbeiren, T
    JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 2003, 36 (02): : 295 - 305
  • [34] Parallel Dynamics of Fully Connected Q-Ising Neural Networks
    D. Bollé
    G. Jongen
    G. M. Shim
    Journal of Statistical Physics, 1998, 91 : 125 - 153
  • [35] Learning Hamiltonian Systems considering System Symmetries in Neural Networks
    Dierkes, Eva
    Flasskamp, Kathrin
    IFAC PAPERSONLINE, 2021, 54 (19): : 210 - 216
  • [36] Depth Degeneracy in Neural Networks: Vanishing Angles in Fully Connected ReLU Networks on Initialization
    Jakub, Cameron
    Nica, Mihai
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 45
  • [37] Partial Label Learning Based on Fully Connected Deep Neural Network
    Li H.
    Wu L.
    He J.
    Zheng R.
    Zhou Y.
    Qiao S.
    International Journal of Circuits, Systems and Signal Processing, 2022, 16 : 287 - 297
  • [38] Finding the Capacity of Fuzzy Neural Networks (FNNs) via Its Equivalent Fully Connected Neural Networks (FFNNs)
    Wang, Jing
    Wang, Chi-Hsu
    Chen, C. L. Philip
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 2193 - 2198
  • [39] Connected target coverage for fully data gathering in wireless sensor networks
    Junbin, Liang
    Ming, Liu
    Xiaoyan, Kui
    Sensors and Transducers, 2013, 155 (08): : 74 - 79
  • [40] A Design Flow Framework for Fully-Connected Neural Networks Rapid Prototyping
    Zompakis, Nikolaos
    Anagnostos, Dimitrios
    Koliogeorgi, Konstantina
    Zervakis, Georgios
    Siozios, Kostas
    INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS (COINS), 2019, : 44 - 49