Regularizing Deep Neural Networks by Noise: Its Interpretation and Optimization

被引:0
|
作者
Noh, Hyeonwoo [1 ]
You, Tackgeun [1 ]
Mun, Jonghwan [1 ]
Han, Bohyung [1 ]
机构
[1] POSTECH, Dept Comp Sci & Engn, Pohang, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Overfitting is one of the most critical challenges in deep neural networks, and there are various types of regularization methods to improve generalization performance. Injecting noises to hidden units during training, e.g., dropout, is known as a successful regularizer, but it is still not clear enough why such training techniques work well in practice and how we can maximize their benefit in the presence of two conflicting objectives-optimizing to true data distribution and preventing overfitting by regularization. This paper addresses the above issues by 1) interpreting that the conventional training methods with regularization by noise injection optimize the lower bound of the true objective and 2) proposing a technique to achieve a tighter lower bound using multiple noise samples per training example in a stochastic gradient descent iteration. We demonstrate the effectiveness of our idea in several computer vision applications.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Understanding and mitigating noise in trained deep neural networks
    Semenova, Nadezhda
    Larger, Laurent
    Brunner, Daniel
    NEURAL NETWORKS, 2022, 146 : 151 - 160
  • [32] Application of Artificial Neural Networks for Noise Barrier Optimization
    Trombetta Zannin, Paulo Henrique
    do Nascimento, Eriberto Oliveira
    da Paz, Elaine Carvalho
    do Valle, Felipe
    ENVIRONMENTS, 2018, 5 (12): : 1 - 20
  • [33] Regularizing Activation Distribution for Training Binarized Deep Networks
    Ding, Ruizhou
    Chin, Ting-Wu
    Liu, Zeye
    Marculescu, Diana
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11400 - 11409
  • [34] Regularizing Neural Networks via Retaining Confident Connections
    Zhang, Shengnan
    Hou, Yuexian
    Wang, Benyou
    Song, Dawei
    ENTROPY, 2017, 19 (07)
  • [35] Automated interpretation of the coronary angioscopy with deep convolutional neural networks
    Miyoshi, Toru
    Higaki, Akinori
    Kawakami, Hideo
    Yamaguchi, Osamu
    OPEN HEART, 2020, 7 (01):
  • [36] Seismic fault interpretation based on deep convolutional neural networks
    Chang D.
    Yong X.
    Wang Y.
    Yang W.
    Li H.-S.
    Zhang G.
    Chang, Dekuan (changdk@petrochina.com.cn), 1600, Science Press (56): : 1 - 8
  • [37] Explaining deep neural networks: A survey on the global interpretation methods
    Saleem, Rabia
    Yuan, Bo
    Kurugollu, Fatih
    Anjum, Ashiq
    Liu, Lu
    NEUROCOMPUTING, 2022, 513 : 165 - 180
  • [38] Synchrosqueezing voices through deep neural networks for horizon interpretation
    AlSalmi, Haifa
    Wang, Yanghua
    INTERPRETATION-A JOURNAL OF SUBSURFACE CHARACTERIZATION, 2024, 12 (03): : SE89 - SE102
  • [39] Semantics for Global and Local Interpretation of Deep Convolutional Neural Networks
    Gu, Jindong
    Zhao, Rui
    Tresp, Volker
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [40] Automatic Seismic Salt Interpretation with Deep Convolutional Neural Networks
    Zeng, Yu
    Jiang, Kebei
    Chen, Jie
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND DATA MINING (ICISDM 2019), 2019, : 16 - 20