End-to-end face parsing via interlinked convolutional neural networks

被引:22
|
作者
Yin, Zi [1 ]
Yiu, Valentin [2 ,3 ]
Hu, Xiaolin [2 ]
Tang, Liang [1 ]
机构
[1] Beijing Forestry Univ, Sch Technol, Beijing 100083, Peoples R China
[2] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, State Key Lab Intelligent Technol & Syst, Inst Artificial Intelligence,THBI,Dept Comp Sci &, Beijing 100084, Peoples R China
[3] Cent Supelec, F-91190 Gif Sur Yvette, France
基金
中国国家自然科学基金;
关键词
STN-iCNN; Face parsing; End-to-end;
D O I
10.1007/s11571-020-09615-4
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Face parsing is an important computer vision task that requires accurate pixel segmentation of facial parts (such as eyes, nose, mouth, etc.), providing a basis for further face analysis, modification, and other applications. Interlinked Convolutional Neural Networks (iCNN) was proved to be an effective two-stage model for face parsing. However, the original iCNN was trained separately in two stages, limiting its performance. To solve this problem, we introduce a simple, end-to-end face parsing framework: STN-aided iCNN(STN-iCNN), which extends the iCNN by adding a Spatial Transformer Network (STN) between the two isolated stages. The STN-iCNN uses the STN to provide a trainable connection to the original two-stage iCNN pipeline, making end-to-end joint training possible. Moreover, as a by-product, STN also provides more precise cropped parts than the original cropper. Due to these two advantages, our approach significantly improves the accuracy of the original model. Our model achieved competitive performance on the Helen Dataset, the standard face parsing dataset. It also achieved superior performance on CelebAMask-HQ dataset, proving its good generalization. Our code has been released at https://github.com/aod321/STN-iCNN.
引用
收藏
页码:169 / 179
页数:11
相关论文
共 50 条
  • [1] End-to-end face parsing via interlinked convolutional neural networks
    Zi Yin
    Valentin Yiu
    Xiaolin Hu
    Liang Tang
    Cognitive Neurodynamics, 2021, 15 : 169 - 179
  • [2] Interlinked Convolutional Neural Networks for Face Parsing
    Zhou, Yisu
    Hu, Xiaolin
    Zhang, Bo
    ADVANCES IN NEURAL NETWORKS - ISNN 2015, 2015, 9377 : 222 - 231
  • [3] An End-to-End System for Unconstrained Face Verification with Deep Convolutional Neural Networks
    Chen, Jun-Cheng
    Ranjan, Rajeev
    Kumar, Amit
    Chen, Ching-Hui
    Patel, Vishal M.
    Chellappa, Rama
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 360 - 368
  • [4] A Multi-Task Framework for Facial Attributes Classification through End-to-End Face Parsing and Deep Convolutional Neural Networks
    Khan, Khalil
    Attique, Muhammad
    Khan, Rehan Ullah
    Syed, Ikram
    Chung, Tae-Sun
    SENSORS, 2020, 20 (02)
  • [5] Leukocyte Segmentation via End-to-End Learning of Deep Convolutional Neural Networks
    Lu, Yan
    Fan, Haoyi
    Li, Zuoyong
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 191 - 200
  • [6] Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks
    Junho Jo
    Hyung Il Koo
    Jae Woong Soh
    Nam Ik Cho
    Multimedia Tools and Applications, 2020, 79 : 32137 - 32150
  • [7] Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks
    Jo, Junho
    Koo, Hyung Il
    Soh, Jae Woong
    Cho, Nam Ik
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32137 - 32150
  • [8] End-to-End Text Recognition with Convolutional Neural Networks
    Wang, Tao
    Wu, David J.
    Coates, Adam
    Ng, Andrew Y.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3304 - 3308
  • [9] An End-to-End Compression Framework Based on Convolutional Neural Networks
    Jiang, Feng
    Tao, Wen
    Liu, Shaohui
    Ren, Jie
    Guo, Xun
    Zhao, Debin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 3007 - 3018
  • [10] An End-to-End Compression Framework Based on Convolutional Neural Networks
    Tao, Wen
    Jiang, Feng
    Zhang, Shengping
    Ren, Jie
    Shi, Wuzhen
    Zuo, Wangmeng
    Guo, Xun
    Zhao, Debin
    2017 DATA COMPRESSION CONFERENCE (DCC), 2017, : 463 - 463