Real time face detection for multimodal speech recognition

被引:0
|
作者
Murai, K [1 ]
Nakamura, S [1 ]
机构
[1] Fuji Xerox, Informat Media Lab, Kanagawa 2590157, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a real time system to detect the speaker's frontal face for multimodal speech recognition. It is widely acknowledged that automatic speech recognizers, as well as humans, can improve recognition performance by adding visual modality, i.e., the speaker's facial image to audio modality([1][2]). Visual modality also provides inaudible information, such as the speaker's facial orientation([3]), and the location of the mouth. To acquire this information, we have to localize the speaker's face in real time. Our system is a combination of skin color detection and spatial feature detection. The color-based detection is fast but depends on the skin and the background color, while the special feature detection requires more computation. We applied color-based pruning to reduce the search space for the spatial feature detection. By detecting the facial orientation, the proposed method functions as a "Face to Talk" switch in place of the "Push to Talk" switch. In our experiment, pruning based on color reduced 53-97% of the search space, and 98.9% of the frontal face was detected correctly by the subsequent spatial detector.
引用
收藏
页码:A373 / A376
页数:4
相关论文
共 50 条
  • [21] Real Time Face Recognition System (RTFRS)
    Haji, Suad
    Varol, Asaf
    2016 4TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSIC AND SECURITY (ISDFS), 2016, : 107 - 111
  • [22] Robust Real-time Face Recognition
    AFRICON, 2013, 2013,
  • [23] RECOGNITION OF SPEECH IN REAL TIME .2.
    FIEVET, F
    MAISSIS, A
    WALRAVE, P
    AUTOMATISME, 1970, 15 (02): : 70 - &
  • [24] The Recognition of Whispered Speech in Real-Time
    Hendrickson, Kristi
    Ernest, Danielle
    EAR AND HEARING, 2022, 43 (02): : 554 - 562
  • [25] AUTOMATIC SPEECH RECOGNITION FOR REAL TIME SYSTEMS
    Singh, Ranjodh
    Yadav, Hemant
    Sharma, Mohit
    Gosain, Sandeep
    Shah, Rajiv Ratn
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2019), 2019, : 189 - 198
  • [26] REAL TIME FACE DETECTION ROBOT
    Maneesha, K.
    Shree, Neha
    Datta, Pranav R.
    Sindhu, S. K.
    Gururaj, C.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 58 - 64
  • [27] Multimodal information fusion application to human emotion recognition from face and speech
    Mansoorizadeh, Muharram
    Charkari, Nasrollah Moghaddam
    MULTIMEDIA TOOLS AND APPLICATIONS, 2010, 49 (02) : 277 - 297
  • [28] Multimodal information fusion application to human emotion recognition from face and speech
    Muharram Mansoorizadeh
    Nasrollah Moghaddam Charkari
    Multimedia Tools and Applications, 2010, 49 : 277 - 297
  • [29] Real-time fault detection in manufacturing environments using face recognition techniques
    Fadel M. Megahed
    Jaime A. Camelio
    Journal of Intelligent Manufacturing, 2012, 23 : 393 - 408
  • [30] PrimeEye: A real-time face detection and recognition system robust to illumination changes
    Choi, J
    Lee, S
    Lee, C
    Yi, J
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2001, 2091 : 360 - 365