Audio-Visual Detection of Multiple Chirping Robots

被引：3

作者：

Gribovskiy, Alexey ^{[1
]}

Mondada, Francesco ^{[1
]}

机构：

[1] Ecole Polytech Fed Lausanne, LSRO, CH-1015 Lausanne, Switzerland

来源：

IAS-10: INTELLIGENT AUTONOMOUS SYSTEMS 10 | 2008年

关键词：

Microphone arrays; sound localization; audio-visual; multi-source; information fusion;

D O I：

10.3233/978-1-58603-887-8-324

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Design, study, and control of mixed animals-robots societies are the fields of scientific exploration that can bring new opportunities for study and control of groups of social insects and animals and, in particular, for improvement of welfare and breeding conditions of domestic animals. Our long-term objective is to develop a mobile robot, socially acceptable by chickens and able to interact with them using appropriate communication channels. For interaction purposes the robot has to know positions of all birds in an experimental area and detect those uttering calls. In this paper, we present an audio-visual approach to locate the robots and animals on a scene and detect their calling activity. The visual tracking is provided by a marker-based tracker with help of an overhead camera. Sound localization is achieved by the beamforming approach using an array of sixteen microphones. Visual and sound information are probabilistically mixed to detect the calling activity. The experimental results demonstrate that our system is capable to detect the sound emission activity of multiple moving robots with 90% probability.

引用

页码：324 / 331

页数：8

共 50 条

[21] Audio-Visual Objects
Kubovy M.
Schutz M.
Review of Philosophy and Psychology, 2010, 1 (1) : 41 - 61
[22] Audio-Visual Segmentation
Zhou, Jinxing
Wang, Jianyuan
Zhang, Jiayi
Sun, Weixuan
Zhang, Jing
Birchfield, Stan
Guo, Dan
Kong, Lingpeng
Wang, Meng
Zhong, Yiran
COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 386 - 403
[23] USING MULTIPLE VISUAL TANDEM STREAMS IN AUDIO-VISUAL SPEECH RECOGNITION
Topkaya, Ibrahim Saygin
Erdogan, Hakan
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4988 - 4991
[24] AUDIO-VISUAL TECHNOLOGIES
TAKESHITA, M
FURUKAWA, M
HAYATSU, R
MURAKAMI, R
SUZUKI, K
HASHIZUME, K
NEC RESEARCH & DEVELOPMENT, 1990, (96): : 265 - 277
[25] AUDIO-VISUAL POTPOURRI
不详
INDUSTRIAL PHOTOGRAPHY, 1968, 17 (07): : 30 - &
[26] Audio-Visual Techniques
Sears, William P., Jr.
EDUCATION, 1948, 69 (02): : 132 - 132
[27] AUDIO-VISUAL UNIT
WHARTON, BA
PEDIATRICS, 1971, 47 (05) : 957 - &
[28] Audio-visual imposture
Karam, Walid
Mokbel, Chafic
Greige, Hanna
Chollet, Gerard
MOBILE MULTIMEDIA/IMAGE PROCESSING FOR MILITARY AND SECURITY APPLICATIONS, 2006, 6250
[29] AUDIO-VISUAL CLINICS
GRABER, TM
HANNETT, HA
AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 1963, 49 (07) : 538 - &
[30] Audio-visual biometrics
Aleksic, Petar S.
Katsaggelos, Aggelos K.
PROCEEDINGS OF THE IEEE, 2006, 94 (11) : 2025 - 2044

← 1 2 3 4 5 →