Wide Hidden Expansion Layer for Deep Convolutional Neural Networks

被引：0

作者：

Wang, Min ^{[1
]}

Liu, Baoyuan ^{[2
]}

Foroosh, Hassan ^{[1
]}

机构：

[1] Univ Cent Florida, Orlando, FL 32816 USA

[2] Amazon, Seattle, WA USA

来源：

2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2020年

关键词：

D O I：

10.1109/wacv45572.2020.9093436

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Non-linearity is an essential factor contributing to the success of deep convolutional neural networks. Increasing the non-linearity in the network will enhance the network's learning capability, attributing to better performance. We present a novel Wide Hidden Expansion (WHE) layer that can significantly increase (by an order of magnitude) the number of activation functions in the network, with very little increase of computational complexity and memory consumption. It can be flexibly embedded with different network architectures to boost the performance of the original networks. The WHE layer is composed of a wide hidden layer, in which each channel only connects with two input channels and one output channel. Before connecting to the output channel, each intermediate channel in the WHE layer is followed by one activation function. In this manner, the number of activation functions can grow along with the number of channels in the hidden layer. We apply the WHE layer to ResNet, WideResNet, SENet, and MobileNet architectures and evaluate on ImageNet, CIFAR-100, and Tiny ImageNet dataset. On the ImageNet dataset, models with the WHE layer can achieve up to 2.01% higher Top-1 accuracy than baseline models, with less than 4% computation increase and less than 2% more parameters. On CIFAR-100 and Tiny ImageNet, when applying the WHE layer to ResNet models, it demonstrates consistent improvement in the accuracy of the networks. Applying the WHE layer to ResNet backbone of the CenterNet object detection model can also boost its performance on COCO and Pascal VOC datasets.

引用

页码：923 / 931

页数：9

共 50 条

[21] DEEP CONVOLUTIONAL NEURAL NETWORKS FOR LVCSR
Sainath, Tara N.
Mohamed, Abdel-rahman
Kingsbury, Brian
Ramabhadran, Bhuvana
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8614 - 8618
[22] Universality of deep convolutional neural networks
Zhou, Ding-Xuan
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2020, 48 (02) : 787 - 794
[23] A Review on Deep Convolutional Neural Networks
Aloysius, Neena
Geetha, M.
2017 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2017, : 588 - 592
[24] Convergence of deep convolutional neural networks
Xu, Yuesheng
Zhang, Haizhang
NEURAL NETWORKS, 2022, 153 : 553 - 563
[25] Spatial deep convolutional neural networks
Wang, Qi
Parker, Paul A.
Lund, Robert
SPATIAL STATISTICS, 2025, 66
[26] Fusion of Deep Convolutional Neural Networks
Suchy, Robert
Ezekiel, Soundararajan
Cornacchia, Maria
2017 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2017,
[27] Exploring Hidden Dimensions in Parallelizing Convolutional Neural Networks
Jia, Zhihao
Lin, Sina
Qi, Charles R.
Aiken, Alex
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[28] Detection of the Information Hidden in Image by Convolutional Neural Networks
Zubov, Ilya G.
Lysenko, Nikolai V.
Labkov, Gleb M.
PROCEEDINGS OF THE 2019 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (EICONRUS), 2019, : 393 - 394
[29] Neural Networks with Marginalized Corrupted Hidden Layer
Li, Yanjun
Xin, Xin
Guo, Ping
NEURAL INFORMATION PROCESSING, PT III, 2015, 9491 : 506 - 514
[30] GABOR BINARY LAYER IN CONVOLUTIONAL NEURAL NETWORKS
Jiang, Chenzhi
Su, Jianbo
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3408 - 3412

← 1 2 3 4 5 →