An end-to-end face parsing model using channel and spatial attentions

被引:5
|
作者
Kim, Hyungjoon [1 ]
Kim, Hyeonwoo [1 ]
Cho, Seongkuk [1 ]
Hwang, Eenjun [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Face parsing; Attention mechanism; Image segmentation; FACIAL LANDMARK DETECTION;
D O I
10.1016/j.measurement.2022.110807
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Facial image parsing requires accurate extraction of facial components and features, and image segmentation can be used. Recently, various attention mechanisms showed excellent performance in segmentation by extracting features based on spatial and channel relationships for input images. In this paper, we propose a new face parsing technique using an attention block that combines the spatial attention block and the channel attention block to effectively utilize their functions. In this process, we improve the structure of the two blocks to compensate for their weaknesses. The attention block extracts features related to the shape of facial components from spatial relationships and concentrates on more important channels from correlation among channels. We built several segmentation models using the proposed block and compared their performance with well-known segmentation models. Experimental results showed that our combined block-based model can improve the segmentation accuracy by more than 5% in F1 score compared to other models.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] END-TO-END LEARNING OF PARSING MODELS FOR INFORMATION RETRIEVAL
    Gillenwater, Jennifer
    He, Xiaodong
    Gao, Jianfeng
    Deng, Li
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3312 - 3316
  • [12] End-to-End Argument Mining as Biaffine Dependency Parsing
    Ye, Yuxiao
    Teufel, Simone
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 669 - 678
  • [13] End-to-End One-Shot Human Parsing
    He, Haoyu
    Zhang, Jing
    Zhuang, Bohan
    Cai, Jianfei
    Tao, Dacheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14481 - 14496
  • [14] End-to-End Learning of Communications Systems Without a Channel Model
    Aoudia, Faycal Ait
    Hoydis, Jakob
    2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 298 - 303
  • [15] End-to-end consensus using end-to-end channels
    Wiesmann, Matthias
    Defago, Xavier
    12TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, 2006, : 341 - +
  • [16] Table Structure Recognition and Form Parsing by End-to-End Object Detection and Relation Parsing
    Li, Xiao-Hui
    Yin, Fei
    Dai, He-Sen
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2022, 132
  • [17] SPRING Goes Online: End-to-End AMR Parsing and Generation
    Blloshmi, Rexhina
    Bevilacqua, Michele
    Fabiano, Edoardo
    Caruso, Valentina
    Navigli, Roberto
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2021, : 134 - 142
  • [18] END-TO-END PART-LEVEL ACTION PARSING WITH TRANSFORMER
    Chen, Xiaojia
    Wang, Xuanhan
    Chen, Beitao
    Gao, Lianli
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 756 - 761
  • [19] Using Multiple Masks to Improve End-to-End Face Recognition Performance
    Neylan, Christopher A.
    Salgian, Andrea
    ADVANCES IN VISUAL COMPUTING, PT II, PROCEEDINGS, 2008, 5359 : 329 - 335
  • [20] End-to-End FusVAE for Face Image Fusion
    Li, Xiang
    Chen, Bo
    Wen, Meijin
    Wang, Haoshuang
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,