Alpha matting for portraits using encoder-decoder models

被引:0
|
作者
Akshat Srivastava
Srivatsav Raghu
Abitha K Thyagarajan
Jayasri Vaidyaraman
Mohanaprasad Kothandaraman
Pavan Sudheendra
Avinav Goel
机构
[1] VIT University,School of Electronics Engineering (SENSE)
[2] Samsung R&D Institute,undefined
来源
关键词
Alpha matting; Image segmentation; Deep learning; Encoder-decoder models;
D O I
暂无
中图分类号
学科分类号
摘要
Image matting is a technique used to extract the foreground and background from a given image. In the past, classical algorithms based on sampling, propagation, or a combination of the two were used to perform image matting; however, most of these have produced poor results when applied to images with complex backgrounds. They are also unable to extract with high accuracy foreground images that are comprised of thin objects. In this context, the use of deep learning to solve the image matting problem has gained increasing popularity. In this paper, an encoder-decoder model for alpha matting of human portraits using deep learning is proposed. The model used comprises two parts: the first is an encoder-decoder model, which is a deep convolutional network that has 11 convolutional layers and 5 max-pooling layers in the encoder stage and 11 convolutional layers and 5 unpooling layers in the decoder stage. This portion of the model takes the image and trimap as input produces the coarse alpha matte as the output. The second part is the refinement stage with four convolutional layers, responsible for further refining the coarse alpha matte that was produced by the encoder-decoder stage to obtain an alpha matte of high accuracy. The model was trained using 43,100 images. When tested using the alphamatting.com dataset, our model’s output was comparable to the industry standard, yielding an average MSE of 0.023 and an average SAD loss of 66.5.
引用
收藏
页码:14517 / 14528
页数:11
相关论文
共 50 条
  • [11] Encoder-decoder semantic segmentation models for pressure wound images
    Eldem, Huseyin
    Ulker, Erkan
    Isikli, Osman Yasar
    IMAGING SCIENCE JOURNAL, 2022, 70 (02): : 75 - 86
  • [12] Analytical study of the encoder-decoder models for ultrasound image segmentation
    Srivastava, Somya
    Vidyarthi, Ankit
    Jain, Shikha
    SERVICE ORIENTED COMPUTING AND APPLICATIONS, 2024, 18 (01) : 81 - 100
  • [13] Using Convolutional Encoder-Decoder for Document Image Binarization
    Peng, Xujun
    Cao, Huaigu
    Natarajan, Prem
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 708 - 713
  • [14] Unsupervised Feature Selection using Encoder-Decoder Networks
    SharifiPour, Sasan
    Fayyazi, Hossein
    Sabokro, Mohammad
    2020 6TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2020,
  • [15] Using LSTM encoder-decoder for rhetorical structure prediction
    de Moura, Gustavo Bennemann
    Feltrim, Valeria Delisandra
    2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 278 - 283
  • [16] Table Structure Recognition Using CoDec Encoder-Decoder
    Pegu, Bhanupriya
    Singh, Maneet
    Agarwal, Aakash
    Mitra, Aniruddha
    Singh, Karamjit
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II, 2021, 12917 : 66 - 80
  • [17] Image Segmentation Using Encoder-Decoder with Deformable Convolutions
    Gurita, Andreea
    Mocanu, Irina Georgiana
    SENSORS, 2021, 21 (05) : 1 - 27
  • [18] Semantic road segmentation using encoder-decoder architectures
    Latsaheb B.
    Sharma S.
    Hasija S.
    Multimedia Tools and Applications, 2025, 84 (9) : 5961 - 5983
  • [19] Using Neural Encoder-Decoder Models With Continuous Outputs for Remote Sensing Image Captioning
    Ramos, Rita
    Martins, Bruno
    IEEE ACCESS, 2022, 10 : 24852 - 24863
  • [20] Efficient Decoder Reduction for a Variety of Encoder-Decoder Problems
    van der Putten, Joost
    van der Sommen, Fons
    De With, Peter H. N.
    IEEE ACCESS, 2020, 8 : 169444 - 169455