The Elements of End-to-end Deep Face Recognition: A Survey of Recent Advances

被引:53
|
作者
Du, Hang [1 ]
Shi, Hailin [2 ]
Zeng, Dan [1 ]
Zhang, Xiao-Ping [3 ]
Mei, Tao [2 ]
机构
[1] Shanghai Univ, 99 Shangda Rd BaoShan Dist, Shanghai 200444, Peoples R China
[2] JD AI Res, Beijing, Peoples R China
[3] Ryerson Univ, Toronto, ON, Canada
关键词
Deep learning; convolutional neural network; face recognition; face detection; face alignment; face representation; REPRESENTATION; NETWORK; 3D; CLASSIFICATION; EIGENFACES; FEATURES;
D O I
10.1145/3507902
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Face recognition (FR) is one of the most popular and long-standing topics in computer vision. With the recent development of deep learning techniques and large-scale datasets, deep face recognition has made remarkable progress and has been widely used in many real-world applications. Given a natural image or video frame as input, an end-to-end deep face recognition system outputs the face feature for recognition. To achieve this, a typical end-to-end system is built with three key elements: face detection, face alignment, and face representation. Face detection locates faces in the image or frame. Then, the face alignment is proceeded to calibrate the faces to the canonical view and crop them with a normalized pixel size. Finally, in the stage of face representation, the discriminative features are extracted from the aligned face for recognition. Nowadays, all of the three elements are fulfilled by the technique of deep convolutional neural network. In this survey article, we present a comprehensive review about the recent advance of each element of the end-to-end deep face recognition, since the thriving deep learning techniques have greatly improved their capability of them. To start with, we present an overview of the end-to-end deep face recognition. Then, we review the advance of each element, respectively, covering many aspects such as the to-date algorithm designs, evaluation metrics, datasets, performance comparison, existing challenges, and promising directions for future research. Also, we provide a detailed discussion about the effect of each element on its subsequent elements and the holistic system. Through this survey, we wish to bring contributions in two aspects: first, readers can conveniently identify the methods which are quite strong-baseline style in the subcategory for further exploration; second, one can also employ suitable methods for establishing a state-of-the-art end-to-end face recognition system from scratch.
引用
收藏
页数:42
相关论文
共 50 条
  • [1] Recent Advances in End-to-End Automatic Speech Recognition
    Li, Jinyu
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2022, 11 (01)
  • [2] Recent Advancements in End-to-End Autonomous Driving Using Deep Learning: A Survey
    Chib, Pranav Singh
    Singh, Pravendra
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 103 - 118
  • [3] FACE DETECTION AND RECOGNITION FOR HOME SERVICE ROBOTS WITH END-TO-END DEEP NEURAL NETWORKS
    Jiang, Wei
    Wang, Wei
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2232 - 2236
  • [4] Deep Covariance Feature and CNN-based End-to-End Masked Face Recognition
    Junayed, Masum Shah
    Sadeghzadeh, Arezoo
    Islam, Md Baharul
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [5] An End-to-End Face Recognition System Evaluation Framework
    West Virginia University
  • [6] An end-to-end face recognition method with alignment learning
    Tang, Fenggao
    Wu, Xuedong
    Zhu, Zhiyu
    Wan, Zhengang
    Chang, Yanchao
    Du, Zhaoping
    Gu, Lili
    OPTIK, 2020, 205
  • [7] End-to-End Spatial Transform Face Detection and Recognition
    Zhang H.
    Chi L.
    Virtual Reality and Intelligent Hardware, 2020, 2 (02): : 119 - 131
  • [8] End-to-End Deep Learning for Driver Distraction Recognition
    Koesdwiady, Arief
    Bedawi, Safaa M.
    Ou, Chaojie
    Karray, Fakhri
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2017, 2017, 10317 : 11 - 18
  • [9] DEEP CONTEXT: END-TO-END CONTEXTUAL SPEECH RECOGNITION
    Pundak, Golan
    Sainath, Tara N.
    Prabhavalkar, Rohit
    Kannan, Anjuli
    Zhao, Ding
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 418 - 425
  • [10] End-to-End Protocols and Performance Metrics For Unconstrained Face Recognition
    Duncan, James A.
    Kalka, Nathan D.
    Maze, Brianna
    Jain, Anil K.
    2019 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2019,