Image colorization using deep convolutional auto-encoder with multi-skip connections

被引：0

作者：

Xin Jin

Yide Di

Qian Jiang

Xing Chu

Qing Duan

Shaowen Yao

Wei Zhou

机构：

[1] Yunnan University,School of Software

来源：

Soft Computing | 2023年 / 27卷

关键词：

Auto-encoder; Convolutional neural network; Deep learning; Image processing; Image colorization; Residual neural network;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The colorization of grayscale images is a challenging task in image processing. Recently, deep learning has shown remarkable performance in image colorization. However, the detail loss and color distortion are still serious problem for most existing methods, and some useful features may be lost in the processes of various convolutional layers because of the vanishing gradient problem. Therefore, there is still a considerable space to reach the roof of image colorization. In this work, we propose a deep convolutional auto-encoder with special multi-skip connections for image colorization in YUV color space, and the specific contributions or designs of this work are shown as the following five points. First, a given gray image is used as the Y channel to input a deep learning model to predict U and V channel. Second, the adopted encoder-decoder consists of a main path and two branch paths, and the branch path has two skip connection ways that include one shortcut in each three layers and one shortcut in each six layers. Third, the convolutional kernel size is set as 2*2 that is a special consideration in the path of one shortcut in each six layers. Fourth, a composite loss function is proposed based on the mean square error and gradient that is defined to calculate the errors between the ground truth and the predicted result. Finally, we also discuss the reasonable network parameters, such as the way of shortcut connection, the convolutional kernel size of shortcut connection, and loss function parameters. Experiments on different image datasets show that the proposed image colorization model is effective, and the scores of the PNSR, RMSE, SSIM, and Pearson correlation coefficient are, respectively, to 27.0595, 0.1311, 0.561, and 0.9771.

引用

页码：3037 / 3052

页数：15

共 50 条

[41] Deep Stacked Sparse Auto-encoder based on Patches for Image Classification
Jemel, Intidar
Hassairi, Salima
Ejbali, Ridha
Zaied, Mourad
TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
[42] Script Selection Using Convolutional Auto-encoder for TTS Speech Corpus
Shamsi, Meysam
Lolive, Damien
Barbot, Nelly
Chevelu, Jonathan
SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 423 - 432
[43] Deep Auto-encoder Based Multi-task Learning Using Probabilistic Transcriptions
Das, Amit
Hasegawa-Johnson, Mark
Vesely, Karel
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2073 - 2077
[44] Deep Learning of Convolutional Auto-encoder for Image Matching and 3D Object Reconstruction in the Infrared Range
Knyaz, Vladimir A.
Vygolov, Oleg
Kniaz, Vladimir V.
Vizilter, Yury
Gorbatsevich, Vladimir
Luhmann, Thomas
Conen, Niklas
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2155 - 2164
[45] Unsupervised deep feature representation using adversarial auto-encoder
Cai, Jinyu
Wang, Shiping
Guo, Wenzhong
2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER PHYSICAL SYSTEMS (ICPS 2019), 2019, : 749 - 754
[46] Hyperspectral image classification using an extended Auto-Encoder method
Ghasrodashti, Elham Kordi
Sharma, Nabin
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 92
[47] Speckle Noise Reduction in Ultrasound Images using Denoising Auto-encoder with Skip connection
Bhute, Suraj
Mandal, Subhamoy
Guha, Debashree
PROCEEDINGS OF THE 2024 IEEE SOUTH ASIAN ULTRASONICS SYMPOSIUM, SAUS 2024, 2024,
[48] Binary Coding of Speech Spectrograms Using a Deep Auto-encoder
Deng, L.
Seltzer, M.
Yu, D.
Acero, A.
Mohamed, A.
Hinton, G.
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1692 - +
[49] HIERARCHICAL FEATURE EXTTRATCTION FOR OBJECT RECOGITION IN COMPLEX SAR IMAGE USING MODIFIED CONVOLUTIONAL AUTO-ENCODER
Tian, S. R.
Wang, C.
Zhang, H.
2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 854 - 857
[50] A Multi-Site Joint Air Pollution Prediction Model Based on Convolutional Auto-Encoder Deep Learning
Zhang B.
Lu Y.-J.
Qin D.-M.
Zou G.-J.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (06): : 1410 - 1427

← 1 2 3 4 5 →