3-D Facial Landmarks Detection for Intelligent Video Systems

被引:14
|
作者
Hoang, Van-Thanh [1 ]
Huang, De-Shuang [2 ]
Jo, Kang-Hyun [3 ,4 ]
机构
[1] Univ Ulsan, Grad Sch Elect Engn, Elect & Comp Engn, Ulsan 44610, South Korea
[2] Tongji Univ, Sch Elect & Informat Engn, Inst Machine Learning & Syst Biol, Shanghai 201804, Peoples R China
[3] Tongji Univ, Shanghai, Peoples R China
[4] Univ Ulsan, Sch Elect Engn, Ulsan, South Korea
关键词
Face; Three-dimensional displays; Detectors; Computer architecture; Convolution; Task analysis; Computational modeling; Convolution block; convolutional neural network (CNN); facial landmarks; stacked hourglass;
D O I
10.1109/TII.2020.2966513
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Facial landmark detection is a fundamental research topic in computer vision that is widely adopted in many applications. Recently, thanks to the development of convolutional neural networks, this topic has been largely improved. This article proposes facial-landmark detector, which is based on a state-of-the-art architecture for landmark localization called stacked hourglass network, to obtain accurate facial landmark-points. More specifically, this article uses residual networks as the backbone instead of a 7 x 7 convolution layer. Additionally, it modifies the hourglass modules by using the residual-dense blocks in the mainstream for capturing more efficient features and the 1 x 1 convolution layers in the branch streams for reducing the model size and computational time, instead of the original residual blocks. The proposed architecture also enhances the features from modified hourglass modules with finer-resolution features via a lateral connection to generate more accurate results. The proposed network can outperform other state-of-the-art methods on the AFLW2000-3D dataset and the LS3D-W dataset, the largest three-dimensional (3-D face) alignment dataset to date.
引用
收藏
页码:578 / 586
页数:9
相关论文
共 50 条
  • [31] 3-D RECONSTRUCTION FOR EVALUATION OF FACIAL TRAUMA
    ZINREICH, SJ
    AMERICAN JOURNAL OF NEURORADIOLOGY, 1992, 13 (03) : 893 - 895
  • [32] Real-time localization of 3D facial landmarks
    Zhang, Xiaobo
    Pan, Gang
    Ren, Haoyi
    Wang, Yueming
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2013, 25 (09): : 1325 - 1337
  • [33] Depth Coding Using a Boundary Reconstruction Filter for 3-D Video Systems
    Oh, Kwan-Jung
    Vetro, Anthony
    Ho, Yo-Sung
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (03) : 350 - 359
  • [34] 3-D video capturing for multi-projection type 3-D display
    Kawakita, Masahiro
    Gurbuz, Sabri
    Iwasawa, Shoichiro
    Lopez-Gulliver, Roberto
    Yano, Sumio
    Ando, Hiroshi
    Inoue, Naomi
    THREE-DIMENSIONAL IMAGING, VISUALIZATION, AND DISPLAY 2011, 2011, 8043
  • [35] Reproducibility of facial soft tissue landmarks on facial images captured on a 3D camera
    Othman, Siti Adibah
    Ahmad, Roshahida
    Merican, Amir Feisal
    Jamaludin, Marhazlinda
    AUSTRALIAN ORTHODONTIC JOURNAL, 2013, 29 (01) : 58 - 65
  • [36] Stereo-assisted landmark detection for the analysis of changes in 3-D facial shape
    Naftel, AJ
    Trenouth, MJ
    MEDICAL INFORMATICS AND THE INTERNET IN MEDICINE, 2004, 29 (02): : 137 - 155
  • [37] Optimal 3-D coefficient tree structure for 3-D wavelet video coding
    He, C
    Dong, JY
    Zheng, YF
    Gao, ZG
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (10) : 961 - 972
  • [38] Integrating 2-D video actors into 3-D augmented-reality systems
    Macintyre, B
    Lohse, M
    Bolter, JD
    Moreno, E
    PRESENCE-VIRTUAL AND AUGMENTED REALITY, 2002, 11 (02): : 189 - 202
  • [39] A 3D Statistical Facial Feature Model and Its Application on Locating Facial Landmarks
    Zhao, Xi
    Dellandrea, Emmanuel
    Chen, Liming
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2009, 5807 : 686 - 697
  • [40] Detection of Facial Landmarks in 3D Face Scans Using the Discriminative Generalized Hough Transform (DGHT)
    Boeer, Gordon
    Hahmann, Ferdinand
    Buhr, Ines
    Essig, Harald
    Schramm, Hauke
    BILDVERARBEITUNG FUR DIE MEDIZIN 2015: ALGORITHMEN - SYSTEME - ANWENDUNGEN, 2015, : 299 - 304