Automatic Segmentation of Audio Signals for Bird Species Identification

被引:9
|
作者
Evangelista, Thiago L. F. [1 ]
Priolli, Thales M. [1 ]
Silla, Carlos N., Jr. [1 ]
Angelico, Bruno A. [1 ]
Kaestner, Celso A. A. [2 ]
机构
[1] Univ Tecnol Fed Parana, Ave Alberto Carazzai 1640, BR-86300000 Cornelio Procopio, Parana, Brazil
[2] Univ Tecnol Fed Parana, BR-80230901 Curitiba, Parana, Brazil
来源
2014 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM) | 2014年
关键词
Processing; Pattern Recognition; Machine Learning; Bird Species Identification; SOUNDS; CLASSIFICATION;
D O I
10.1109/ISM.2014.46
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The identification of bird species from their audio recorded songs are nowadays used in several important applications, such as to monitor the quality of the environment and to prevent bird-plane collisions near airports. The complete identification cycle involves the use of: (a) recording devices to acquire the songs, (b) audio processing techniques to remove the noise and to select the most representative elements of the signal, (c) feature extraction procedures to obtain relevant characteristics, and (d) decision procedures to make the identification. The decision procedures can be obtained by Machine Learning (ML) algorithms, considering the problem in a standard classification scenario. One key element is this cycle is the selection of the most relevant segments of the audio for identification purposes. In this paper we show that the use of short audio segments with high amplitude - called pulses in our work - outperforms the use of the complete audio records in the species identification task. We also show how these pulses can be automatically obtained, based on measurements performed directly on the audio signal. The employed classifiers are trained using a previously labeled database of bird songs. We use a database that contains bird song recordings from 75 species which appear in the Southern Atlantic Coast of South America. Obtained results show that the use of automatically obtained pulses and a SVM classifier produce the best results; all the necessary procedures can be installed in a dedicated hardware, allowing the construction of a specific bird identification device.
引用
收藏
页码:223 / 228
页数:6
相关论文
共 50 条
  • [31] Identifying Colombian Bird Species from Audio Recordings
    Reyes, Angie K.
    Caicedo, Juan C.
    Camargo, Jorge E.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2016, 2017, 10125 : 274 - 281
  • [32] Automatic Audio Segmentation Using the Generalized Likelihood Ratio
    Wang, D.
    Vogt, R.
    Mason, M.
    Sridharan, S.
    ICSPCS: 2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, PROCEEDINGS, 2008, : 341 - 345
  • [33] Automatic segmentation and clustering for speaker indexing of audio databases
    Chen, YX
    Gao, J
    Wang, Q
    PROCEEDINGS OF THE 11TH JOINT INTERNATIONAL COMPUTER CONFERENCE, 2005, : 399 - 403
  • [34] Deep Learning-based Automatic Bird Species Identification from Isolated Recordings
    Noumida, A.
    Rajan, Rajeev
    2021 8TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS (ICSCC), 2021, : 252 - 256
  • [35] A Method for Automatic Segmentation and Parameter Estimation of Bird Vocalizations
    Barmatz, Hagai
    Klein, Dana
    Vortman, Yoni
    Toledo, Sivan
    Lavner, Yizhar
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP 2019), 2019, : 211 - 216
  • [36] Automatic Detection and Removal of Impulsive Noise in Audio Signals
    Oudre, Laurent
    IMAGE PROCESSING ON LINE, 2015, 5 : 267 - 281
  • [37] Automatic mood detection and tracking of music audio signals
    Lu, L
    Liu, D
    Zhang, HJ
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 5 - 18
  • [38] Factors in automatic musical genre classification of audio signals
    Li, T
    Tzanetakis, G
    2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, 2003, : 143 - 146
  • [39] A New Automatic Method For Seismic Signals Segmentation
    Pikoulis, Erion-Vasilis
    Psarakis, Emmanouil Z.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 3973 - 3976
  • [40] An algorithm for the automatic segmentation of acoustic emission signals
    Khalfallah, R
    Simard, P
    TRENDS IN NDE SCIENCE AND TECHNOLOGY - PROCEEDINGS OF THE 14TH WORLD CONFERENCE ON NDT (14TH WCNDT), VOLS 1-5, 1996, : 2555 - 2558