Improvement of sector based Multiple speaker localization in a smart room

被引:0
|
作者
Hesam, M. [1 ]
Marvi, H. [1 ]
机构
[1] Shahrood Univ Technol, Dept Elect & Robot Engn, Shahrood, Iran
关键词
multiperson localization; time delay of arrival(TDOA) head oriantation; microphone array;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent advances in computer technology and speech processing and the interest on human-machine communication have made possible development of hands-free speech application with microphone array in smart room environments. One of the most important tasks in a smart room is localization of multispeaker that permits a wide spectrum of application. Combined of hyperbolae produced by time delay estimation (TOE) between several microphones pair utilizes for source localization. In this paper, by using the TDE combination based on multiplication of spatial likelihood function (SLFs) generated from each microphone pair and the head orientation information, a new acoustic multi-speaker localization function has been proposed that we call it OPROD-PHAT. For the search space reduction divided the space of meeting room into a few sections, and for each time frame, we estimate the average OPROD-PHAT function output power within a volume of section, and by using a new two step adaptive threshold, we determined much better which sections contain active speaker. Finally we also implemented a closed-form TDOA based localization approaches for each active section. Has been shown it is a way to apply single speaker TDOA method to a multispeaker problem. The result of simulation show superior performance of proposed system.
引用
收藏
页码:470 / 473
页数:4
相关论文
共 50 条
  • [1] Automatic Speaker Localization based on Speaker Identification -A Smart Room Application-
    Ouamour, Siham
    Sayoud, Halim
    2013 FOURTH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY AND ACCESSIBILITY (ICTA), 2013,
  • [2] Smart room: Participant and speaker localization and identiefication
    Busso, C
    Hernanz, S
    Chu, CW
    Kwon, SI
    Lee, S
    Georgiou, PG
    Cohen, I
    Narayanan, S
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1117 - 1120
  • [3] A speaker localization system for lecture room environment
    Parviainen, Mikko
    Pirinen, Tuomo
    Pertila, Pasi
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2006, 4299 : 225 - +
  • [4] Robust speaker localization in meeting room domain
    Pertila, P.
    Parviainen, M.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 497 - +
  • [5] A NEURAL NETWORK BASED ALGORITHM FOR SPEAKER LOCALIZATION IN A MULTI-ROOM ENVIRONMENT
    Vesperini, Fabio
    Vecchiotti, Paolo
    Principi, Emanuele
    Squartini, Stefano
    Piazza, Francesco
    2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
  • [6] A PROBABILISTIC FRAMEWORK FOR MULTIPLE SPEAKER LOCALIZATION
    Oualil, Youssef
    Magimai-Doss, Mathew
    Faubel, Friedrich
    Klakow, Dietrich
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3962 - 3966
  • [7] Speaker localization using microphone array in a reverberant room
    Zou, QY
    Rahardja, S
    Cai, ZB
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 354 - 357
  • [8] Design of Room-Layout Estimator Using Smart Speaker
    Joya, Tomoki
    Ishida, Shigemi
    Mitsukude, Yudai
    Arakawa, Yutaka
    MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES, 2022, 419 : 24 - 39
  • [9] Performance Improvement of TDOA-Based Speaker Localization in Joint Noisy and Reverberant Conditions
    Abutalebi, Hamid Reza
    Momenzadeh, Hossein
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,
  • [10] Improvement of Speaker Vector-Based Speaker Verification
    Tadokoro, Naoki
    Kosaka, Tetsuo
    Kato, Masaharu
    Kohda, Masaki
    FIFTH INTERNATIONAL CONFERENCE ON INFORMATION ASSURANCE AND SECURITY, VOL 1, PROCEEDINGS, 2009, : 721 - 724