Alpha matting for portraits using encoder-decoder models

被引:0
|
作者
Akshat Srivastava
Srivatsav Raghu
Abitha K Thyagarajan
Jayasri Vaidyaraman
Mohanaprasad Kothandaraman
Pavan Sudheendra
Avinav Goel
机构
[1] VIT University,School of Electronics Engineering (SENSE)
[2] Samsung R&D Institute,undefined
来源
关键词
Alpha matting; Image segmentation; Deep learning; Encoder-decoder models;
D O I
暂无
中图分类号
学科分类号
摘要
Image matting is a technique used to extract the foreground and background from a given image. In the past, classical algorithms based on sampling, propagation, or a combination of the two were used to perform image matting; however, most of these have produced poor results when applied to images with complex backgrounds. They are also unable to extract with high accuracy foreground images that are comprised of thin objects. In this context, the use of deep learning to solve the image matting problem has gained increasing popularity. In this paper, an encoder-decoder model for alpha matting of human portraits using deep learning is proposed. The model used comprises two parts: the first is an encoder-decoder model, which is a deep convolutional network that has 11 convolutional layers and 5 max-pooling layers in the encoder stage and 11 convolutional layers and 5 unpooling layers in the decoder stage. This portion of the model takes the image and trimap as input produces the coarse alpha matte as the output. The second part is the refinement stage with four convolutional layers, responsible for further refining the coarse alpha matte that was produced by the encoder-decoder stage to obtain an alpha matte of high accuracy. The model was trained using 43,100 images. When tested using the alphamatting.com dataset, our model’s output was comparable to the industry standard, yielding an average MSE of 0.023 and an average SAD loss of 66.5.
引用
收藏
页码:14517 / 14528
页数:11
相关论文
共 50 条
  • [1] Alpha matting for portraits using encoder-decoder models
    Srivastava, Akshat
    Raghu, Srivatsav
    Thyagarajan, Abitha K.
    Vaidyaraman, Jayasri
    Kothandaraman, Mohanaprasad
    Sudheendra, Pavan
    Goel, Avinav
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (10) : 14517 - 14528
  • [2] LegoNN: Building Modular Encoder-Decoder Models
    Dalmia, Siddharth
    Okhonko, Dmytro
    Lewis, Mike
    Edunov, Sergey
    Watanabe, Shinji
    Metze, Florian
    Zettlemoyer, Luke
    Mohamed, Abdelrahman
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 3112 - 3126
  • [3] Ensemble Encoder-Decoder Models for Predicting Land Transformation
    Pourmohammadi, Pariya
    Strager, Michael P.
    Adjeroh, Donald A.
    IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2021, 14 : 11429 - 11438
  • [4] KILM: Knowledge Injection into Encoder-Decoder Language Models
    Xu, Yan
    Namazifar, Mahdi
    Hazarika, Devamanyu
    Padmakumar, Aishwarya
    Liu, Yang
    Hakkani-Tur, Dilek
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5013 - 5035
  • [5] Encoder-decoder models for latent phonological representations of words
    Jacobs, Cassandra L.
    Mailhot, Frederic
    16TH SIGMORPHON WORKSHOP ON COMPUTATIONAL RESEARCH IN PHONETICS PHONOLOGY, AND MORPHOLOGY (SIGMORPHON 2019), 2019, : 206 - 217
  • [6] Ensemble Encoder-Decoder Models for Predicting Land Transformation
    Pourmohammadi, Pariya
    Strager, Michael P.
    Adjeroh, Donald A.
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 11429 - 11438
  • [7] On Mining Conditions using Encoder-decoder Networks
    Gallego, Fernando O.
    Corchuelo, Rafael
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 624 - 630
  • [8] Confidence measures in encoder-decoder models for speech recognition
    Woodward, Alejandro
    Bonnin, Clara
    Masuda, Issey
    Varas, David
    Bou-Balust, Elisenda
    Riveiro, Juan Carlos
    INTERSPEECH 2020, 2020, : 611 - 615
  • [9] Variational Memory Encoder-Decoder
    Hung Le
    Truyen Tran
    Thin Nguyen
    Venkatesh, Svetha
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [10] Analytical study of the encoder-decoder models for ultrasound image segmentation
    Somya Srivastava
    Ankit Vidyarthi
    Shikha Jain
    Service Oriented Computing and Applications, 2024, 18 : 81 - 100