Real-Time Head Orientation from a Monocular Camera Using Deep Neural Network

被引:44
|
作者
Ahn, Byungtae [1 ]
Park, Jaesik [1 ]
Kweon, In So [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
来源
关键词
D O I
10.1007/978-3-319-16811-1_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an efficient and accurate head orientation estimation algorithm using a monocular camera. Our approach is leveraged by deep neural network and we exploit the architecture in a data regression manner to learn the mapping function between visual appearance and three dimensional head orientation angles. Therefore, in contrast to classification based approaches, our system outputs continuous head orientation. The algorithm uses convolutional filters trained with a large number of augmented head appearances, thus it is user independent and covers large pose variations. Our key observation is that an input image having 32 x 32 resolution is enough to achieve about 3 degrees of mean square error, which can be used for efficient head orientation applications. Therefore, our architecture takes only 1ms on roughly localized head positions with the aid of GPU. We also propose particle filter based post-processing to enhance stability of the estimation further in video sequences. We compare the performance with the state-of-the-art algorithm which utilizes depth sensor and we validate our head orientation estimator on Internet photos and video.
引用
收藏
页码:82 / 96
页数:15
相关论文
共 50 条
  • [41] Real-Time Forward Collision Warning system using nested Kalman filter for monocular camera
    Lim, Qun
    He, Yichang
    Tan, U-Xuan
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 868 - 873
  • [42] Real-Time Algorithm for Relative Position Estimation Between Person and Robot Using a Monocular Camera
    Lee, Jung Uk
    Sun, Ju Young
    Won, Mooncheol
    TRANSACTIONS OF THE KOREAN SOCIETY OF MECHANICAL ENGINEERS A, 2013, 37 (12) : 1445 - 1452
  • [43] Multi-task neural network with physical constraint for real-time multi-person 3D pose estimation from monocular camera
    Dingli Luo
    Songlin Du
    Takeshi Ikenaga
    Multimedia Tools and Applications, 2021, 80 : 27223 - 27244
  • [44] Multi-task neural network with physical constraint for real-time multi-person 3D pose estimation from monocular camera
    Luo, Dingli
    Du, Songlin
    Ikenaga, Takeshi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (18) : 27223 - 27244
  • [45] Framework for estimating distance and dimension attributes of pedestrians in real-time environments using monocular camera
    Raza, Mudassar
    Chen, Zonghai
    Rehman, Saeed Ur
    Wang, Peng
    Wang, Ji-Kai
    NEUROCOMPUTING, 2018, 275 : 533 - 545
  • [46] Deep Neural Network Based Real-Time Intrusion Detection System
    Sharuka Promodya Thirimanne
    Lasitha Jayawardana
    Lasith Yasakethu
    Pushpika Liyanaarachchi
    Chaminda Hewage
    SN Computer Science, 2022, 3 (2)
  • [47] A Smart Deep Convolutional Neural Network for Real-Time Surface Inspection
    Passos, Adriano G.
    Cousseau, Tiago
    Luersen, Marco A.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 41 (02): : 583 - 593
  • [48] Real-Time Fake News Detection Using Big Data Analytics and Deep Neural Network
    Babar, Muhammad
    Ahmad, Awais
    Tariq, Muhammad Usman
    Kaleem, Sarah
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 5189 - 5198
  • [49] Real-time CVSA decals recognition system using deep convolutional neural network architectures
    Yepez, Juan
    Castro-Zunti, Riel
    Choi, Younhee
    Ko, Seok-Bum
    IET INTELLIGENT TRANSPORT SYSTEMS, 2021, 15 (11) : 1359 - 1371
  • [50] TOWARDS REAL-TIME CRACK DETECTION USING A DEEP NEURAL NETWORK WITH A BAYESIAN FUSION ALGORITHM
    Fang, Fen
    Li, Liyuan
    Rice, Mark
    Lim, Joo-Hwee
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2976 - 2980