Automatic Segmentation of Audio Signals for Bird Species Identification

被引：9

作者：

Evangelista, Thiago L. F. ^{[1
]}

Priolli, Thales M. ^{[1
]}

Silla, Carlos N., Jr. ^{[1
]}

Angelico, Bruno A. ^{[1
]}

Kaestner, Celso A. A. ^{[2
]}

机构：

[1] Univ Tecnol Fed Parana, Ave Alberto Carazzai 1640, BR-86300000 Cornelio Procopio, Parana, Brazil

[2] Univ Tecnol Fed Parana, BR-80230901 Curitiba, Parana, Brazil

来源：

2014 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM) | 2014年

关键词：

Processing; Pattern Recognition; Machine Learning; Bird Species Identification; SOUNDS; CLASSIFICATION;

D O I：

10.1109/ISM.2014.46

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The identification of bird species from their audio recorded songs are nowadays used in several important applications, such as to monitor the quality of the environment and to prevent bird-plane collisions near airports. The complete identification cycle involves the use of: (a) recording devices to acquire the songs, (b) audio processing techniques to remove the noise and to select the most representative elements of the signal, (c) feature extraction procedures to obtain relevant characteristics, and (d) decision procedures to make the identification. The decision procedures can be obtained by Machine Learning (ML) algorithms, considering the problem in a standard classification scenario. One key element is this cycle is the selection of the most relevant segments of the audio for identification purposes. In this paper we show that the use of short audio segments with high amplitude - called pulses in our work - outperforms the use of the complete audio records in the species identification task. We also show how these pulses can be automatically obtained, based on measurements performed directly on the audio signal. The employed classifiers are trained using a previously labeled database of bird songs. We use a database that contains bird song recordings from 75 species which appear in the Southern Atlantic Coast of South America. Obtained results show that the use of automatically obtained pulses and a SVM classifier produce the best results; all the necessary procedures can be installed in a dedicated hardware, allowing the construction of a specific bird identification device.

引用

页码：223 / 228

页数：6

共 50 条

[31] Identifying Colombian Bird Species from Audio Recordings
Reyes, Angie K.
Caicedo, Juan C.
Camargo, Jorge E.
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2016, 2017, 10125 : 274 - 281
[32] Automatic Audio Segmentation Using the Generalized Likelihood Ratio
Wang, D.
Vogt, R.
Mason, M.
Sridharan, S.
ICSPCS: 2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, PROCEEDINGS, 2008, : 341 - 345
[33] Automatic segmentation and clustering for speaker indexing of audio databases
Chen, YX
Gao, J
Wang, Q
PROCEEDINGS OF THE 11TH JOINT INTERNATIONAL COMPUTER CONFERENCE, 2005, : 399 - 403
[34] Deep Learning-based Automatic Bird Species Identification from Isolated Recordings
Noumida, A.
Rajan, Rajeev
2021 8TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS (ICSCC), 2021, : 252 - 256
[35] A Method for Automatic Segmentation and Parameter Estimation of Bird Vocalizations
Barmatz, Hagai
Klein, Dana
Vortman, Yoni
Toledo, Sivan
Lavner, Yizhar
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP 2019), 2019, : 211 - 216
[36] Automatic Detection and Removal of Impulsive Noise in Audio Signals
Oudre, Laurent
IMAGE PROCESSING ON LINE, 2015, 5 : 267 - 281
[37] Automatic mood detection and tracking of music audio signals
Lu, L
Liu, D
Zhang, HJ
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 5 - 18
[38] Factors in automatic musical genre classification of audio signals
Li, T
Tzanetakis, G
2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, 2003, : 143 - 146
[39] A New Automatic Method For Seismic Signals Segmentation
Pikoulis, Erion-Vasilis
Psarakis, Emmanouil Z.
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 3973 - 3976
[40] An algorithm for the automatic segmentation of acoustic emission signals
Khalfallah, R
Simard, P
TRENDS IN NDE SCIENCE AND TECHNOLOGY - PROCEEDINGS OF THE 14TH WORLD CONFERENCE ON NDT (14TH WCNDT), VOLS 1-5, 1996, : 2555 - 2558

← 1 2 3 4 5 →