Regularizing Deep Neural Networks by Noise: Its Interpretation and Optimization

被引：0

作者：

Noh, Hyeonwoo ^{[1
]}

You, Tackgeun ^{[1
]}

Mun, Jonghwan ^{[1
]}

Han, Bohyung ^{[1
]}

机构：

[1] POSTECH, Dept Comp Sci & Engn, Pohang, South Korea

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017) | 2017年 / 30卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Overfitting is one of the most critical challenges in deep neural networks, and there are various types of regularization methods to improve generalization performance. Injecting noises to hidden units during training, e.g., dropout, is known as a successful regularizer, but it is still not clear enough why such training techniques work well in practice and how we can maximize their benefit in the presence of two conflicting objectives-optimizing to true data distribution and preventing overfitting by regularization. This paper addresses the above issues by 1) interpreting that the conventional training methods with regularization by noise injection optimize the lower bound of the true objective and 2) proposing a technique to achieve a tighter lower bound using multiple noise samples per training example in a stochastic gradient descent iteration. We demonstrate the effectiveness of our idea in several computer vision applications.

引用

页数：10

共 50 条

[31] Understanding and mitigating noise in trained deep neural networks
Semenova, Nadezhda
Larger, Laurent
Brunner, Daniel
NEURAL NETWORKS, 2022, 146 : 151 - 160
[32] Application of Artificial Neural Networks for Noise Barrier Optimization
Trombetta Zannin, Paulo Henrique
do Nascimento, Eriberto Oliveira
da Paz, Elaine Carvalho
do Valle, Felipe
ENVIRONMENTS, 2018, 5 (12): : 1 - 20
[33] Regularizing Activation Distribution for Training Binarized Deep Networks
Ding, Ruizhou
Chin, Ting-Wu
Liu, Zeye
Marculescu, Diana
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11400 - 11409
[34] Regularizing Neural Networks via Retaining Confident Connections
Zhang, Shengnan
Hou, Yuexian
Wang, Benyou
Song, Dawei
ENTROPY, 2017, 19 (07)
[35] Automated interpretation of the coronary angioscopy with deep convolutional neural networks
Miyoshi, Toru
Higaki, Akinori
Kawakami, Hideo
Yamaguchi, Osamu
OPEN HEART, 2020, 7 (01):
[36] Seismic fault interpretation based on deep convolutional neural networks
Chang D.
Yong X.
Wang Y.
Yang W.
Li H.-S.
Zhang G.
Chang, Dekuan (changdk@petrochina.com.cn), 1600, Science Press (56): : 1 - 8
[37] Explaining deep neural networks: A survey on the global interpretation methods
Saleem, Rabia
Yuan, Bo
Kurugollu, Fatih
Anjum, Ashiq
Liu, Lu
NEUROCOMPUTING, 2022, 513 : 165 - 180
[38] Synchrosqueezing voices through deep neural networks for horizon interpretation
AlSalmi, Haifa
Wang, Yanghua
INTERPRETATION-A JOURNAL OF SUBSURFACE CHARACTERIZATION, 2024, 12 (03): : SE89 - SE102
[39] Semantics for Global and Local Interpretation of Deep Convolutional Neural Networks
Gu, Jindong
Zhao, Rui
Tresp, Volker
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[40] Automatic Seismic Salt Interpretation with Deep Convolutional Neural Networks
Zeng, Yu
Jiang, Kebei
Chen, Jie
PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND DATA MINING (ICISDM 2019), 2019, : 16 - 20

← 1 2 3 4 5 →