End-to-end face parsing via interlinked convolutional neural networks

被引:22
|
作者
Yin, Zi [1 ]
Yiu, Valentin [2 ,3 ]
Hu, Xiaolin [2 ]
Tang, Liang [1 ]
机构
[1] Beijing Forestry Univ, Sch Technol, Beijing 100083, Peoples R China
[2] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, State Key Lab Intelligent Technol & Syst, Inst Artificial Intelligence,THBI,Dept Comp Sci &, Beijing 100084, Peoples R China
[3] Cent Supelec, F-91190 Gif Sur Yvette, France
基金
中国国家自然科学基金;
关键词
STN-iCNN; Face parsing; End-to-end;
D O I
10.1007/s11571-020-09615-4
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Face parsing is an important computer vision task that requires accurate pixel segmentation of facial parts (such as eyes, nose, mouth, etc.), providing a basis for further face analysis, modification, and other applications. Interlinked Convolutional Neural Networks (iCNN) was proved to be an effective two-stage model for face parsing. However, the original iCNN was trained separately in two stages, limiting its performance. To solve this problem, we introduce a simple, end-to-end face parsing framework: STN-aided iCNN(STN-iCNN), which extends the iCNN by adding a Spatial Transformer Network (STN) between the two isolated stages. The STN-iCNN uses the STN to provide a trainable connection to the original two-stage iCNN pipeline, making end-to-end joint training possible. Moreover, as a by-product, STN also provides more precise cropped parts than the original cropper. Due to these two advantages, our approach significantly improves the accuracy of the original model. Our model achieved competitive performance on the Helen Dataset, the standard face parsing dataset. It also achieved superior performance on CelebAMask-HQ dataset, proving its good generalization. Our code has been released at https://github.com/aod321/STN-iCNN.
引用
收藏
页码:169 / 179
页数:11
相关论文
共 50 条
  • [21] CONVOLUTIONAL ANALYSIS OPERATOR LEARNING BY END-TO-END TRAINING OF ITERATIVE NEURAL NETWORKS
    Kofler, Andreas
    Wald, Christian
    Schaeffter, Tobias
    Haltmeier, Markus
    Kolbitsch, Christoph
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (IEEE ISBI 2022), 2022,
  • [22] Towards End-to-End Speech Recognition with Deep Multipath Convolutional Neural Networks
    Zhang, Wei
    Zhai, Minghao
    Huang, Zilong
    Liu, Chen
    Li, Wei
    Cao, Yi
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PART VI, 2019, 11745 : 332 - 341
  • [23] Towards end-to-end likelihood-free inference with convolutional neural networks
    Radev, Stefan T.
    Mertens, Ulf K.
    Voss, Andreas
    Koethe, Ullrich
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2020, 73 (01): : 23 - 43
  • [24] A new end-to-end image compression system based on convolutional neural networks
    Akyazi, Pinar
    Ebrahimi, Touradj
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLII, 2019, 11137
  • [25] End-to-End Wireframe Parsing
    Zhou, Yichao
    Qi, Haozhi
    Ma, Yi
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 962 - 971
  • [26] An end-to-end face parsing model using channel and spatial attentions
    Kim, Hyungjoon
    Kim, Hyeonwoo
    Cho, Seongkuk
    Hwang, Eenjun
    MEASUREMENT, 2022, 191
  • [27] End-to-end 3D face reconstruction with deep neural networks
    Dou, Pengfei
    Shah, Shishir K.
    Kakadiaris, Ioannis A.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1503 - 1512
  • [28] End-to-End Partial Discharge Detection in Power Cables via Time-Domain Convolutional Neural Networks
    Mohammad Azam Khan
    Jaegul Choo
    Yong-Hwa Kim
    Journal of Electrical Engineering & Technology, 2019, 14 : 1299 - 1309
  • [29] End-to-End Partial Discharge Detection in Power Cables via Time-Domain Convolutional Neural Networks
    Khan, Mohammad Azam
    Choo, Jaegul
    Kim, Yong-Hwa
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2019, 14 (03) : 1299 - 1309
  • [30] Firearm Detection via Convolutional Neural Networks: Comparing a Semantic Segmentation Model Against End-to-End Solutions
    Egiazarov, Alexander
    Zennaro, Fabio Massimo
    Mavroeidis, Vasileios
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1796 - 1804