The Elements of End-to-end Deep Face Recognition: A Survey of Recent Advances

被引：53

作者：

Du, Hang ^{[1
]}

Shi, Hailin ^{[2
]}

Zeng, Dan ^{[1
]}

Zhang, Xiao-Ping ^{[3
]}

Mei, Tao ^{[2
]}

机构：

[1] Shanghai Univ, 99 Shangda Rd BaoShan Dist, Shanghai 200444, Peoples R China

[2] JD AI Res, Beijing, Peoples R China

[3] Ryerson Univ, Toronto, ON, Canada

来源：

ACM COMPUTING SURVEYS | 2022年 / 54卷 / 10S期

关键词：

Deep learning; convolutional neural network; face recognition; face detection; face alignment; face representation; REPRESENTATION; NETWORK; 3D; CLASSIFICATION; EIGENFACES; FEATURES;

D O I：

10.1145/3507902

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Face recognition (FR) is one of the most popular and long-standing topics in computer vision. With the recent development of deep learning techniques and large-scale datasets, deep face recognition has made remarkable progress and has been widely used in many real-world applications. Given a natural image or video frame as input, an end-to-end deep face recognition system outputs the face feature for recognition. To achieve this, a typical end-to-end system is built with three key elements: face detection, face alignment, and face representation. Face detection locates faces in the image or frame. Then, the face alignment is proceeded to calibrate the faces to the canonical view and crop them with a normalized pixel size. Finally, in the stage of face representation, the discriminative features are extracted from the aligned face for recognition. Nowadays, all of the three elements are fulfilled by the technique of deep convolutional neural network. In this survey article, we present a comprehensive review about the recent advance of each element of the end-to-end deep face recognition, since the thriving deep learning techniques have greatly improved their capability of them. To start with, we present an overview of the end-to-end deep face recognition. Then, we review the advance of each element, respectively, covering many aspects such as the to-date algorithm designs, evaluation metrics, datasets, performance comparison, existing challenges, and promising directions for future research. Also, we provide a detailed discussion about the effect of each element on its subsequent elements and the holistic system. Through this survey, we wish to bring contributions in two aspects: first, readers can conveniently identify the methods which are quite strong-baseline style in the subcategory for further exploration; second, one can also employ suitable methods for establishing a state-of-the-art end-to-end face recognition system from scratch.

引用

页数：42

共 50 条

[1] Recent Advances in End-to-End Automatic Speech Recognition
Li, Jinyu
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2022, 11 (01)
[2] Recent Advancements in End-to-End Autonomous Driving Using Deep Learning: A Survey
Chib, Pranav Singh
Singh, Pravendra
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 103 - 118
[3] FACE DETECTION AND RECOGNITION FOR HOME SERVICE ROBOTS WITH END-TO-END DEEP NEURAL NETWORKS
Jiang, Wei
Wang, Wei
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2232 - 2236
[4] Deep Covariance Feature and CNN-based End-to-End Masked Face Recognition
Junayed, Masum Shah
Sadeghzadeh, Arezoo
Islam, Md Baharul
2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
[5] An End-to-End Face Recognition System Evaluation Framework
West Virginia University
[6] An end-to-end face recognition method with alignment learning
Tang, Fenggao
Wu, Xuedong
Zhu, Zhiyu
Wan, Zhengang
Chang, Yanchao
Du, Zhaoping
Gu, Lili
OPTIK, 2020, 205
[7] End-to-End Spatial Transform Face Detection and Recognition
Zhang H.
Chi L.
Virtual Reality and Intelligent Hardware, 2020, 2 (02): : 119 - 131
[8] End-to-End Deep Learning for Driver Distraction Recognition
Koesdwiady, Arief
Bedawi, Safaa M.
Ou, Chaojie
Karray, Fakhri
IMAGE ANALYSIS AND RECOGNITION, ICIAR 2017, 2017, 10317 : 11 - 18
[9] DEEP CONTEXT: END-TO-END CONTEXTUAL SPEECH RECOGNITION
Pundak, Golan
Sainath, Tara N.
Prabhavalkar, Rohit
Kannan, Anjuli
Zhao, Ding
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 418 - 425
[10] End-to-End Protocols and Performance Metrics For Unconstrained Face Recognition
Duncan, James A.
Kalka, Nathan D.
Maze, Brianna
Jain, Anil K.
2019 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2019,

← 1 2 3 4 5 →