3-D Facial Landmarks Detection for Intelligent Video Systems

被引:14
|
作者
Hoang, Van-Thanh [1 ]
Huang, De-Shuang [2 ]
Jo, Kang-Hyun [3 ,4 ]
机构
[1] Univ Ulsan, Grad Sch Elect Engn, Elect & Comp Engn, Ulsan 44610, South Korea
[2] Tongji Univ, Sch Elect & Informat Engn, Inst Machine Learning & Syst Biol, Shanghai 201804, Peoples R China
[3] Tongji Univ, Shanghai, Peoples R China
[4] Univ Ulsan, Sch Elect Engn, Ulsan, South Korea
关键词
Face; Three-dimensional displays; Detectors; Computer architecture; Convolution; Task analysis; Computational modeling; Convolution block; convolutional neural network (CNN); facial landmarks; stacked hourglass;
D O I
10.1109/TII.2020.2966513
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Facial landmark detection is a fundamental research topic in computer vision that is widely adopted in many applications. Recently, thanks to the development of convolutional neural networks, this topic has been largely improved. This article proposes facial-landmark detector, which is based on a state-of-the-art architecture for landmark localization called stacked hourglass network, to obtain accurate facial landmark-points. More specifically, this article uses residual networks as the backbone instead of a 7 x 7 convolution layer. Additionally, it modifies the hourglass modules by using the residual-dense blocks in the mainstream for capturing more efficient features and the 1 x 1 convolution layers in the branch streams for reducing the model size and computational time, instead of the original residual blocks. The proposed architecture also enhances the features from modified hourglass modules with finer-resolution features via a lateral connection to generate more accurate results. The proposed network can outperform other state-of-the-art methods on the AFLW2000-3D dataset and the LS3D-W dataset, the largest three-dimensional (3-D face) alignment dataset to date.
引用
收藏
页码:578 / 586
页数:9
相关论文
共 50 条
  • [41] Intelligent 3-D Elevator Shaft Mapping
    Studer, Christian
    Bitzi, Raphael
    Zimmerli, Philipp
    CTBUH Journal, 2021, 2021 (01) : 22 - 29
  • [42] Reproducibility of facial soft tissue landmarks on facial images captured on a 3D camera
    Othman, Siti Adibah
    Ahmad, Roshahida
    Merican, Amir Feisal
    Jamaludin, Marhazlinda
    AUSTRALASIAN ORTHODONTIC JOURNAL, 2013, 29 (01): : 58 - 65
  • [43] Depth Video Coding Using Adaptive Geometry Based Intra Prediction for 3-D Video Systems
    Kang, Min-Koo
    Ho, Yo-Sung
    IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) : 121 - 128
  • [44] AVOIDANCE OF DETECTION IN 3-D
    ZHELEZNOV, VS
    IVANOV, MN
    KURSKII, EA
    MASLOV, EP
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1993, 26 (06) : 55 - 66
  • [45] Assessment of the Reliability and Repeatability of Landmarks Using 3-D Cephalometric Software
    Frongia, Gianluigi
    Piancino, Maria Grazia
    Bracco, Andrea Adriano
    Crincoli, Vito
    Debernardi, Cesare Lorenzo
    Bracco, Pietro
    CRANIO-THE JOURNAL OF CRANIOMANDIBULAR & SLEEP PRACTICE, 2012, 30 (04): : 255 - 263
  • [46] Graphics board supports 3-D and video
    Wright, M
    EDN, 1996, 41 (08) : 29 - 29
  • [47] Special issue on 3-D video technology
    Ngan, KN
    Strintzis, MG
    Tanimoto, M
    Wang, Y
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2000, 10 (02) : 185 - 187
  • [48] Live 3-D video in volumetric display
    Son, JY
    Shestak, SA
    Huschyn, VP
    Ulizko, VA
    Kang, DH
    STEREOSCOPIC DISPLAYS AND VIRTUAL REALITY SYSTEMS IX, 2002, 4660 : 171 - 175
  • [49] Detector allows 3-D OCT video
    Anon
    Biophotonics International, 2002, 9 (06):
  • [50] MULTIRATE 3-D SUBBAND CODING OF VIDEO
    TAUBMAN, D
    ZAKHOR, A
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 1994, 3 (05) : 572 - 588