Automatic Segmentation of Audio Signals for Bird Species Identification

被引：9

作者：

Evangelista, Thiago L. F. ^{[1
]}

Priolli, Thales M. ^{[1
]}

Silla, Carlos N., Jr. ^{[1
]}

Angelico, Bruno A. ^{[1
]}

Kaestner, Celso A. A. ^{[2
]}

机构：

[1] Univ Tecnol Fed Parana, Ave Alberto Carazzai 1640, BR-86300000 Cornelio Procopio, Parana, Brazil

[2] Univ Tecnol Fed Parana, BR-80230901 Curitiba, Parana, Brazil

来源：

2014 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM) | 2014年

关键词：

Processing; Pattern Recognition; Machine Learning; Bird Species Identification; SOUNDS; CLASSIFICATION;

D O I：

10.1109/ISM.2014.46

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The identification of bird species from their audio recorded songs are nowadays used in several important applications, such as to monitor the quality of the environment and to prevent bird-plane collisions near airports. The complete identification cycle involves the use of: (a) recording devices to acquire the songs, (b) audio processing techniques to remove the noise and to select the most representative elements of the signal, (c) feature extraction procedures to obtain relevant characteristics, and (d) decision procedures to make the identification. The decision procedures can be obtained by Machine Learning (ML) algorithms, considering the problem in a standard classification scenario. One key element is this cycle is the selection of the most relevant segments of the audio for identification purposes. In this paper we show that the use of short audio segments with high amplitude - called pulses in our work - outperforms the use of the complete audio records in the species identification task. We also show how these pulses can be automatically obtained, based on measurements performed directly on the audio signal. The employed classifiers are trained using a previously labeled database of bird songs. We use a database that contains bird song recordings from 75 species which appear in the Southern Atlantic Coast of South America. Obtained results show that the use of automatically obtained pulses and a SVM classifier produce the best results; all the necessary procedures can be installed in a dedicated hardware, allowing the construction of a specific bird identification device.

引用

页码：223 / 228

页数：6

共 50 条

[41] Audio hashing technique for automatic song identification
Mapelli, F
Lancini, R
ITRE2003: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: RESEARCH AND EDUCATION, 2003, : 84 - 88
[42] Automatic Detection of Bird Species from Audio Field Recordings using HMM-based Modelling of Frequency Tracks
Jancovic, Peter
Kokuer, Munevver
2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 1779 - 1783
[43] Hierarchical Classification of Bird Species Using Their Audio Recorded Songs
Silla, Carlos N., Jr.
Kaestner, Celso A. A.
2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 1895 - 1900
[44] CONV-CODES: AUDIO HASHING FOR BIRD SPECIES CLASSIFICATION
Thakur, Anshul
Sharma, Pulkit
Abrol, Vinayak
Rajan, Padmanabhan
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8241 - 8245
[45] Automatic segmentation of news items based on video and audio features
Weiqiang Wang
Wen Gao
Journal of Computer Science and Technology, 2002, 17 : 189 - 195
[46] Automatic segmentation of news items based on video and audio features
Wang, WQ
Gao, W
ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 498 - 505
[47] Automatic segmentation of news items based on video and audio features
Wang, WQ
Gao, W
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2002, 17 (02) : 189 - 195
[48] Audio parameterization with robust frame selection for improved bird identification
Ventura, Thiago M.
de Oliveira, Allan G.
Ganchev, Todor D.
de Figueiredo, Josiel M.
Jahn, Olaf
Marques, Marinez I.
Schuchmann, Karl-L.
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) : 8463 - 8471
[49] VISUAL AND ACOUSTIC IDENTIFICATION OF BIRD SPECIES
Marini, A.
Turatti, A. J.
Britto, A. S., Jr.
Koerich, A. L.
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2309 - 2313
[50] Probabilistic approach to automatic music transcription from audio signals
Miyamoto, Kenichi
Kameoka, Hirokazu
Takeda, Haruto
Nishimoto, Takuya
Sagayama, Shigeki
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 697 - +

← 1 2 3 4 5 →