Real-Time Head Orientation from a Monocular Camera Using Deep Neural Network

被引:44
|
作者
Ahn, Byungtae [1 ]
Park, Jaesik [1 ]
Kweon, In So [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
来源
关键词
D O I
10.1007/978-3-319-16811-1_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose an efficient and accurate head orientation estimation algorithm using a monocular camera. Our approach is leveraged by deep neural network and we exploit the architecture in a data regression manner to learn the mapping function between visual appearance and three dimensional head orientation angles. Therefore, in contrast to classification based approaches, our system outputs continuous head orientation. The algorithm uses convolutional filters trained with a large number of augmented head appearances, thus it is user independent and covers large pose variations. Our key observation is that an input image having 32 x 32 resolution is enough to achieve about 3 degrees of mean square error, which can be used for efficient head orientation applications. Therefore, our architecture takes only 1ms on roughly localized head positions with the aid of GPU. We also propose particle filter based post-processing to enhance stability of the estimation further in video sequences. We compare the performance with the state-of-the-art algorithm which utilizes depth sensor and we validate our head orientation estimator on Internet photos and video.
引用
收藏
页码:82 / 96
页数:15
相关论文
共 50 条
  • [31] Semiparallel deep neural network hybrid architecture: first application on depth from monocular camera
    Bazrafkan, Shabab
    Javidnia, Hossein
    Lemley, Joseph
    Corcoran, Peter
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (04)
  • [32] Real-time MRI lungs images revealing using Hybrid feedforward Deep Neural Network and Convolutional Neural Network
    Karthick, M.
    Samuel, Dinesh Jackson
    Prakash, B.
    Sathyaprakash, P.
    Daruvuri, Nandhini
    Ali, Mohammed Hasan
    Aiswarya, R. S.
    INTELLIGENT DATA ANALYSIS, 2023, 27 : S95 - S114
  • [33] Real-time head tracking from the deformation of eye contours using a piecewise affine camera
    Colombo, C
    Del Bimbo, A
    PATTERN RECOGNITION LETTERS, 1999, 20 (07) : 721 - 730
  • [34] Real-time detection of rice phenology through convolutional neural network using handheld camera images
    Han, Jingye
    Shi, Liangsheng
    Yang, Qi
    Huang, Kai
    Zha, Yuanyuan
    Yu, Jin
    PRECISION AGRICULTURE, 2021, 22 (01) : 154 - 178
  • [35] Real-time camera-based face detection using a modified LAMSTAR neural network system
    Girado, JI
    Sandin, DJ
    DeFanti, TA
    Wolf, LK
    APPLICATIONS OF ARTIFICIAL NEURAL NETWORKS IN IMAGE PROCESSING VII, 2003, 5015 : 36 - 46
  • [36] Comparing Monocular Camera Depth Estimation Models for Real-time Applications
    Diab, Abdelrahman
    Sabry, Mohamed
    El Mougy, Amr
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 673 - 680
  • [37] Real-Time 3D Pedestrian Tracking with Monocular Camera
    Xiao, Peng
    Yan, Fei
    Chi, Jiannan
    Wang, Zhiliang
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [38] Real-time target tracking with particle filter in moving monocular camera
    Liu, Guocheng
    Wang, Yongji
    MIPPR 2007: AUTOMATIC TARGET RECOGNITION AND IMAGE ANALYSIS; AND MULTISPECTRAL IMAGE ACQUISITION, PTS 1 AND 2, 2007, 6786
  • [39] Real-time detection of rice phenology through convolutional neural network using handheld camera images
    Jingye Han
    Liangsheng Shi
    Qi Yang
    Kai Huang
    Yuanyuan Zha
    Jin Yu
    Precision Agriculture, 2021, 22 : 154 - 178
  • [40] Real-Time People Tracking in a Camera Network
    Limprasert, Wasit
    Wallace, Andrew
    Michaelson, Greg
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2013, 3 (02) : 263 - 271