Image colorization using deep convolutional auto-encoder with multi-skip connections

被引:0
|
作者
Xin Jin
Yide Di
Qian Jiang
Xing Chu
Qing Duan
Shaowen Yao
Wei Zhou
机构
[1] Yunnan University,School of Software
来源
Soft Computing | 2023年 / 27卷
关键词
Auto-encoder; Convolutional neural network; Deep learning; Image processing; Image colorization; Residual neural network;
D O I
暂无
中图分类号
学科分类号
摘要
The colorization of grayscale images is a challenging task in image processing. Recently, deep learning has shown remarkable performance in image colorization. However, the detail loss and color distortion are still serious problem for most existing methods, and some useful features may be lost in the processes of various convolutional layers because of the vanishing gradient problem. Therefore, there is still a considerable space to reach the roof of image colorization. In this work, we propose a deep convolutional auto-encoder with special multi-skip connections for image colorization in YUV color space, and the specific contributions or designs of this work are shown as the following five points. First, a given gray image is used as the Y channel to input a deep learning model to predict U and V channel. Second, the adopted encoder-decoder consists of a main path and two branch paths, and the branch path has two skip connection ways that include one shortcut in each three layers and one shortcut in each six layers. Third, the convolutional kernel size is set as 2*2 that is a special consideration in the path of one shortcut in each six layers. Fourth, a composite loss function is proposed based on the mean square error and gradient that is defined to calculate the errors between the ground truth and the predicted result. Finally, we also discuss the reasonable network parameters, such as the way of shortcut connection, the convolutional kernel size of shortcut connection, and loss function parameters. Experiments on different image datasets show that the proposed image colorization model is effective, and the scores of the PNSR, RMSE, SSIM, and Pearson correlation coefficient are, respectively, to 27.0595, 0.1311, 0.561, and 0.9771.
引用
收藏
页码:3037 / 3052
页数:15
相关论文
共 50 条
  • [41] Deep Stacked Sparse Auto-encoder based on Patches for Image Classification
    Jemel, Intidar
    Hassairi, Salima
    Ejbali, Ridha
    Zaied, Mourad
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [42] Script Selection Using Convolutional Auto-encoder for TTS Speech Corpus
    Shamsi, Meysam
    Lolive, Damien
    Barbot, Nelly
    Chevelu, Jonathan
    SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 423 - 432
  • [43] Deep Auto-encoder Based Multi-task Learning Using Probabilistic Transcriptions
    Das, Amit
    Hasegawa-Johnson, Mark
    Vesely, Karel
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2073 - 2077
  • [44] Deep Learning of Convolutional Auto-encoder for Image Matching and 3D Object Reconstruction in the Infrared Range
    Knyaz, Vladimir A.
    Vygolov, Oleg
    Kniaz, Vladimir V.
    Vizilter, Yury
    Gorbatsevich, Vladimir
    Luhmann, Thomas
    Conen, Niklas
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2155 - 2164
  • [45] Unsupervised deep feature representation using adversarial auto-encoder
    Cai, Jinyu
    Wang, Shiping
    Guo, Wenzhong
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER PHYSICAL SYSTEMS (ICPS 2019), 2019, : 749 - 754
  • [46] Hyperspectral image classification using an extended Auto-Encoder method
    Ghasrodashti, Elham Kordi
    Sharma, Nabin
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 92
  • [47] Speckle Noise Reduction in Ultrasound Images using Denoising Auto-encoder with Skip connection
    Bhute, Suraj
    Mandal, Subhamoy
    Guha, Debashree
    PROCEEDINGS OF THE 2024 IEEE SOUTH ASIAN ULTRASONICS SYMPOSIUM, SAUS 2024, 2024,
  • [48] Binary Coding of Speech Spectrograms Using a Deep Auto-encoder
    Deng, L.
    Seltzer, M.
    Yu, D.
    Acero, A.
    Mohamed, A.
    Hinton, G.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1692 - +
  • [49] HIERARCHICAL FEATURE EXTTRATCTION FOR OBJECT RECOGITION IN COMPLEX SAR IMAGE USING MODIFIED CONVOLUTIONAL AUTO-ENCODER
    Tian, S. R.
    Wang, C.
    Zhang, H.
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 854 - 857
  • [50] A Multi-Site Joint Air Pollution Prediction Model Based on Convolutional Auto-Encoder Deep Learning
    Zhang B.
    Lu Y.-J.
    Qin D.-M.
    Zou G.-J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (06): : 1410 - 1427