Alpha matting for portraits using encoder-decoder models

被引:0
|
作者
Akshat Srivastava
Srivatsav Raghu
Abitha K Thyagarajan
Jayasri Vaidyaraman
Mohanaprasad Kothandaraman
Pavan Sudheendra
Avinav Goel
机构
[1] VIT University,School of Electronics Engineering (SENSE)
[2] Samsung R&D Institute,undefined
来源
关键词
Alpha matting; Image segmentation; Deep learning; Encoder-decoder models;
D O I
暂无
中图分类号
学科分类号
摘要
Image matting is a technique used to extract the foreground and background from a given image. In the past, classical algorithms based on sampling, propagation, or a combination of the two were used to perform image matting; however, most of these have produced poor results when applied to images with complex backgrounds. They are also unable to extract with high accuracy foreground images that are comprised of thin objects. In this context, the use of deep learning to solve the image matting problem has gained increasing popularity. In this paper, an encoder-decoder model for alpha matting of human portraits using deep learning is proposed. The model used comprises two parts: the first is an encoder-decoder model, which is a deep convolutional network that has 11 convolutional layers and 5 max-pooling layers in the encoder stage and 11 convolutional layers and 5 unpooling layers in the decoder stage. This portion of the model takes the image and trimap as input produces the coarse alpha matte as the output. The second part is the refinement stage with four convolutional layers, responsible for further refining the coarse alpha matte that was produced by the encoder-decoder stage to obtain an alpha matte of high accuracy. The model was trained using 43,100 images. When tested using the alphamatting.com dataset, our model’s output was comparable to the industry standard, yielding an average MSE of 0.023 and an average SAD loss of 66.5.
引用
收藏
页码:14517 / 14528
页数:11
相关论文
共 50 条
  • [41] Video to Text Study using an Encoder-Decoder Networks Approach
    Ismael Orozco, Carlos
    Elena Buemi, Maria
    Jacobo Berlles, Julio
    2018 37TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2018,
  • [42] A survey on handwritten mathematical expression recognition: The rise of encoder-decoder and GNN models
    Truong, Thanh-Nghia
    Nguyen, Cuong Tuan
    Zanibbi, Richard
    Mouchere, Harold
    Nakagawa, Masaki
    PATTERN RECOGNITION, 2024, 153
  • [43] Short-term Inland Vessel Trajectory Prediction with Encoder-Decoder Models
    Donandt, Kathrin
    Boettger, Karim
    Soeffker, Dirk
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 974 - 979
  • [44] Investigation on the Encoder-Decoder Application for Mesh Generation
    Mameli, Marco
    Balloni, Emanuele
    Mancini, Adriano
    Frontoni, Emanuele
    Zingaretti, Primo
    ADVANCES IN COMPUTER GRAPHICS, CGI 2023, PT II, 2024, 14496 : 387 - 400
  • [45] Development of Secure Encoder-Decoder for JPEG Images
    Hamissa, Ghada
    Abd Elkader, Hatem
    Sarhan, Amany
    Fahmy, Mahmoud
    ICCES'2010: THE 2010 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS, 2010, : 189 - 194
  • [46] Encoder-decoder network with RMP for tongue segmentation
    Kusakunniran, Worapan
    Borwarnginn, Punyanuch
    Karnjanapreechakorn, Sarattha
    Thongkanchorn, Kittikhun
    Ritthipravat, Panrasee
    Tuakta, Pimchanok
    Benjapornlert, Paitoon
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (05) : 1193 - 1207
  • [47] Understanding How Encoder-Decoder Architectures Attend
    Aitken, Kyle
    Ramasesh, Vinay V.
    Cao, Yuan
    Maheswaranathan, Niru
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [48] DOM Refinement with neural Encoder-Decoder Networks
    Metzger, Nando
    PFG-JOURNAL OF PHOTOGRAMMETRY REMOTE SENSING AND GEOINFORMATION SCIENCE, 2020, 88 (3-4): : 362 - 363
  • [49] Encoder-decoder multimodal speaker change detection
    Jung, Jee-weon
    Seo, Soonshin
    Heo, Hee-Soo
    Kim, Geonmin
    Kim, You Jin
    Kwon, Young-ki
    Lee, Minjae
    Lee, Bong-Jin
    INTERSPEECH 2023, 2023, : 5311 - 5315
  • [50] Adversarial Signal Denoising with Encoder-Decoder Networks
    Casas, Leslie
    Klimmek, Attila
    Navab, Nassir
    Belagiannis, Vasileios
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 1467 - 1471