Combining texture and stereo disparity cues for real-time face detection

被引:5
作者
Jiang, Feijun
Fischer, Mika [1 ]
Ekenel, Hazim Kemal [1 ,2 ]
Shi, Bertram E. [3 ,4 ]
机构
[1] Karlsruhe Inst Technol, Inst Anthropomat, D-76021 Karlsruhe, Germany
[2] Istanbul Tech Univ, Fac Comp & Informat, Istanbul, Turkey
[3] Hong Kong Univ Sci & Technol, Dept ECE, Hong Kong, Hong Kong, Peoples R China
[4] Hong Kong Univ Sci & Technol, Div Biomed Engn, Hong Kong, Hong Kong, Peoples R China
关键词
Multi-view face detection; Stereo vision; Disparity energy model; Gabor filter; CORTEX;
D O I
10.1016/j.image.2013.07.006
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Intuitively, integrating information from multiple visual cues, such as texture, stereo disparity, and image motion, should improve performance on perceptual tasks, such as object detection. On the other hand, the additional effort required to extract and represent information from additional cues may increase computational complexity. In this work, we show that using biologically inspired integrated representation of texture and stereo disparity information for a multi-view facial detection task leads to not only improved detection performance, but also reduced computational complexity. Disparity information enables us to filter out 90% of image locations as being less likely to contain faces. Performance is improved because the filtering rejects 32% of the false detections made by a similar monocular detector at the same recall rate. Despite the additional computation required to compute disparity information, our binocular detector takes only 42 ms to process a pair of 640 x 480 images, 35% of the time required by the monocular detector. We also show that this integrated detector is computationally more efficient than a detector with similar performance where texture and stereo information is processed separately. (c) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:1100 / 1113
页数:14
相关论文
共 42 条
[1]   SPATIOTEMPORAL ENERGY MODELS FOR THE PERCEPTION OF MOTION [J].
ADELSON, EH ;
BERGEN, JR .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (02) :284-299
[2]  
[Anonymous], 2010, UMCS2010009
[3]  
[Anonymous], STEREO IMAGES LABELE
[4]  
[Anonymous], IEEE ICRA WORKSH PEO
[5]  
[Anonymous], EUR C COMP VIS CIT
[6]  
[Anonymous], TR2000396 MITS EL RE
[7]  
[Anonymous], CALTECH BACKGROUND I
[8]  
[Anonymous], ICCV WORKSH GPU COMP
[9]   Using Depth Information to Improve Face Detection [J].
Burgin, Walker ;
Pantofaru, Caroline ;
Smart, William D. .
PROCEEDINGS OF THE 6TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTIONS (HRI 2011), 2011, :119-120
[10]  
Chen J, 2004, LECT NOTES COMPUT SC, V3338, P90