Deep Convolutional Network Cascade for Facial Point Detection

被引:883
|
作者
Sun, Yi [1 ]
Wang, Xiaogang [2 ,3 ]
Tang, Xiaoou [1 ,3 ]
机构
[1] Chinese Univ Hong Kong, Dept Informat Engn, Hong Kong, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Hong Kong, Peoples R China
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
关键词
D O I
10.1109/CVPR.2013.446
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new approach for estimation of the positions of facial keypoints with three-level carefully designed convolutional networks. At each level, the outputs of multiple networks are fused for robust and accurate estimation. Thanks to the deep structures of convolutional networks, global high-level features are extracted over the whole face region at the initialization stage, which help to locate high accuracy keypoints. There are two folds of advantage for this. First, the texture context information over the entire face is utilized to locate each keypoint. Second, since the networks are trained to predict all the keypoints simultaneously, the geometric constraints among keypoints are implicitly encoded. The method therefore can avoid local minimum caused by ambiguity and data corruption in difficult image samples due to occlusions, large pose variations, and extreme lightings. The networks at the following two levels are trained to locally refine initial predictions and their inputs are limited to small regions around the initial predictions. Several network structures critical for accurate and robust facial point detection are investigated. Extensive experiments show that our approach outperforms state-of-the- art methods in both detection accuracy and reliability(1).
引用
收藏
页码:3476 / 3483
页数:8
相关论文
共 50 条
  • [21] Deep convolutional neural network architecture for facial emotion recognition
    Pruthviraja, Dayananda
    Kumar, Ujjwal Mohan
    Parameswaran, Sunil
    Chowdary, Vemulapalli Guna
    Bharadwaj, Varun
    PEERJ COMPUTER SCIENCE, 2024, 10 : 1 - 20
  • [22] Deep convolutional BiLSTM fusion network for facial expression recognition
    Dandan Liang
    Huagang Liang
    Zhenbo Yu
    Yipu Zhang
    The Visual Computer, 2020, 36 : 499 - 508
  • [23] Facial Expression Classification Using Deep Convolutional Neural Network
    Choi, In-kyu
    Ahn, Ha-eun
    Yoo, Jisang
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2018, 13 (01) : 485 - 492
  • [24] Deep convolutional BiLSTM fusion network for facial expression recognition
    Liang, Dandan
    Liang, Huagang
    Yu, Zhenbo
    Zhang, Yipu
    VISUAL COMPUTER, 2020, 36 (03): : 499 - 508
  • [25] Fly facial recognition based on deep convolutional neural network
    Chen Y.-T.
    Chen W.-N.
    Zhang X.-Z.
    Li Y.-Y.
    Wang J.-S.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2020, 28 (07): : 1558 - 1567
  • [26] Facial expression recognition based on deep convolutional neural network
    Wang, Kejun
    Chen, Jing
    Zhang, Xinyi
    Sun, Liying
    2018 IEEE 8TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER), 2018, : 629 - 634
  • [27] Facial Expression Recognition Based on Deep Binary Convolutional Network
    Zhou L.
    Liu J.
    Li W.
    Mi J.
    Lei B.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (03): : 425 - 436
  • [28] Deep Convolutional Generative Adversarial Network and Convolutional Neural Network for Smoke Detection
    Yin, Hang
    Wei, Yurong
    Liu, Hedan
    Liu, Shuangyin
    Liu, Chuanyun
    Gao, Yacui
    COMPLEXITY, 2020, 2020
  • [29] Deep Convolutional Generative Adversarial Network and Convolutional Neural Network for Smoke Detection
    Yin, Hang
    Wei, Yurong
    Liu, Hedan
    Liu, Shuangyin
    Liu, Chuanyun
    Gao, Yacui
    Liu, Shuangyin (hdlsyxlq@126.com), 1600, Hindawi Limited (2020):
  • [30] Extensive Facial Landmark Localization with Coarse-to-fine Convolutional Network Cascade
    Zhou, Erjin
    Fan, Haoqiang
    Cao, Zhimin
    Jiang, Yuning
    Yin, Qi
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 386 - 391