Audio-Visual Detection of Multiple Chirping Robots

被引:3
|
作者
Gribovskiy, Alexey [1 ]
Mondada, Francesco [1 ]
机构
[1] Ecole Polytech Fed Lausanne, LSRO, CH-1015 Lausanne, Switzerland
关键词
Microphone arrays; sound localization; audio-visual; multi-source; information fusion;
D O I
10.3233/978-1-58603-887-8-324
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Design, study, and control of mixed animals-robots societies are the fields of scientific exploration that can bring new opportunities for study and control of groups of social insects and animals and, in particular, for improvement of welfare and breeding conditions of domestic animals. Our long-term objective is to develop a mobile robot, socially acceptable by chickens and able to interact with them using appropriate communication channels. For interaction purposes the robot has to know positions of all birds in an experimental area and detect those uttering calls. In this paper, we present an audio-visual approach to locate the robots and animals on a scene and detect their calling activity. The visual tracking is provided by a marker-based tracker with help of an overhead camera. Sound localization is achieved by the beamforming approach using an array of sixteen microphones. Visual and sound information are probabilistically mixed to detect the calling activity. The experimental results demonstrate that our system is capable to detect the sound emission activity of multiple moving robots with 90% probability.
引用
收藏
页码:324 / 331
页数:8
相关论文
共 50 条
  • [31] AUDIO-VISUAL DEVELOPMENTS
    Schwartz, Mortimer
    JOURNAL OF LEGAL EDUCATION, 1952, 5 (01) : 88 - 95
  • [32] Detection of inconsistent audio-visual events in virtual reality
    Sorkin, A.
    Peled, A. ]
    Weinshall, D.
    PERCEPTION, 2006, 35 : 203 - 204
  • [33] Audio-Visual Classification and Detection of Human Manipulation Actions
    Pieropan, Alessandro
    Salvi, Giampiero
    Pauwels, Karl
    Kjellstrom, Hedvig
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 3045 - 3052
  • [34] AUDIO-VISUAL FOR THE PATIENT
    STUTTLE, FL
    JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 1959, 41 (07): : 1362 - 1362
  • [35] The Audio-Visual Reader
    不详
    JOURNAL OF EDUCATIONAL RESEARCH, 1955, 48 (07): : 552 - 553
  • [36] Temporal Feature Prediction in Audio-Visual Deepfake Detection
    Gao, Yuan
    Wang, Xuelong
    Zhang, Yu
    Zeng, Ping
    Ma, Yingjie
    ELECTRONICS, 2024, 13 (17)
  • [37] Vehicle Detection and Classification using Audio-Visual cues
    Piyush, P.
    Rajan, Rajeev
    Mary, Leena
    Koshy, Bino I.
    2016 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2016, : 732 - 736
  • [38] Audio-visual synchrony detection under scotopic conditions
    Cass, J.
    Churruca, K.
    Van der Burg, E.
    Alais, D.
    PERCEPTION, 2012, 41 : 225 - 225
  • [39] Audio-Visual Prosody: Perception, Detection, and Synthesis of Prominence
    Al Moubayed, Samer
    Beskow, Jonas
    Granstrom, Bjorn
    House, David
    TOWARD AUTONOMOUS, ADAPTIVE, AND CONTEXT-AWARE MULTIMODAL INTERFACES: THEORETICAL AND PRACTICAL ISSUES, 2011, 6456 : 55 - 71
  • [40] VOICE ACTIVITY DETECTION USING AUDIO-VISUAL INFORMATION
    Petsatodis, Theodoros
    Pnevmatikakis, Aristodemos
    Boukis, Christos
    2009 16TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 216 - +