Audio-Visual Detection of Multiple Chirping Robots

被引:3
|
作者
Gribovskiy, Alexey [1 ]
Mondada, Francesco [1 ]
机构
[1] Ecole Polytech Fed Lausanne, LSRO, CH-1015 Lausanne, Switzerland
关键词
Microphone arrays; sound localization; audio-visual; multi-source; information fusion;
D O I
10.3233/978-1-58603-887-8-324
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Design, study, and control of mixed animals-robots societies are the fields of scientific exploration that can bring new opportunities for study and control of groups of social insects and animals and, in particular, for improvement of welfare and breeding conditions of domestic animals. Our long-term objective is to develop a mobile robot, socially acceptable by chickens and able to interact with them using appropriate communication channels. For interaction purposes the robot has to know positions of all birds in an experimental area and detect those uttering calls. In this paper, we present an audio-visual approach to locate the robots and animals on a scene and detect their calling activity. The visual tracking is provided by a marker-based tracker with help of an overhead camera. Sound localization is achieved by the beamforming approach using an array of sixteen microphones. Visual and sound information are probabilistically mixed to detect the calling activity. The experimental results demonstrate that our system is capable to detect the sound emission activity of multiple moving robots with 90% probability.
引用
收藏
页码:324 / 331
页数:8
相关论文
共 50 条
  • [21] Audio-Visual Objects
    Kubovy M.
    Schutz M.
    Review of Philosophy and Psychology, 2010, 1 (1) : 41 - 61
  • [22] Audio-Visual Segmentation
    Zhou, Jinxing
    Wang, Jianyuan
    Zhang, Jiayi
    Sun, Weixuan
    Zhang, Jing
    Birchfield, Stan
    Guo, Dan
    Kong, Lingpeng
    Wang, Meng
    Zhong, Yiran
    COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 386 - 403
  • [23] USING MULTIPLE VISUAL TANDEM STREAMS IN AUDIO-VISUAL SPEECH RECOGNITION
    Topkaya, Ibrahim Saygin
    Erdogan, Hakan
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4988 - 4991
  • [24] AUDIO-VISUAL TECHNOLOGIES
    TAKESHITA, M
    FURUKAWA, M
    HAYATSU, R
    MURAKAMI, R
    SUZUKI, K
    HASHIZUME, K
    NEC RESEARCH & DEVELOPMENT, 1990, (96): : 265 - 277
  • [25] AUDIO-VISUAL POTPOURRI
    不详
    INDUSTRIAL PHOTOGRAPHY, 1968, 17 (07): : 30 - &
  • [26] Audio-Visual Techniques
    Sears, William P., Jr.
    EDUCATION, 1948, 69 (02): : 132 - 132
  • [27] AUDIO-VISUAL UNIT
    WHARTON, BA
    PEDIATRICS, 1971, 47 (05) : 957 - &
  • [28] Audio-visual imposture
    Karam, Walid
    Mokbel, Chafic
    Greige, Hanna
    Chollet, Gerard
    MOBILE MULTIMEDIA/IMAGE PROCESSING FOR MILITARY AND SECURITY APPLICATIONS, 2006, 6250
  • [29] AUDIO-VISUAL CLINICS
    GRABER, TM
    HANNETT, HA
    AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 1963, 49 (07) : 538 - &
  • [30] Audio-visual biometrics
    Aleksic, Petar S.
    Katsaggelos, Aggelos K.
    PROCEEDINGS OF THE IEEE, 2006, 94 (11) : 2025 - 2044