Automatic Segmentation of Audio Signals for Bird Species Identification

被引:9
|
作者
Evangelista, Thiago L. F. [1 ]
Priolli, Thales M. [1 ]
Silla, Carlos N., Jr. [1 ]
Angelico, Bruno A. [1 ]
Kaestner, Celso A. A. [2 ]
机构
[1] Univ Tecnol Fed Parana, Ave Alberto Carazzai 1640, BR-86300000 Cornelio Procopio, Parana, Brazil
[2] Univ Tecnol Fed Parana, BR-80230901 Curitiba, Parana, Brazil
关键词
Processing; Pattern Recognition; Machine Learning; Bird Species Identification; SOUNDS; CLASSIFICATION;
D O I
10.1109/ISM.2014.46
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The identification of bird species from their audio recorded songs are nowadays used in several important applications, such as to monitor the quality of the environment and to prevent bird-plane collisions near airports. The complete identification cycle involves the use of: (a) recording devices to acquire the songs, (b) audio processing techniques to remove the noise and to select the most representative elements of the signal, (c) feature extraction procedures to obtain relevant characteristics, and (d) decision procedures to make the identification. The decision procedures can be obtained by Machine Learning (ML) algorithms, considering the problem in a standard classification scenario. One key element is this cycle is the selection of the most relevant segments of the audio for identification purposes. In this paper we show that the use of short audio segments with high amplitude - called pulses in our work - outperforms the use of the complete audio records in the species identification task. We also show how these pulses can be automatically obtained, based on measurements performed directly on the audio signal. The employed classifiers are trained using a previously labeled database of bird songs. We use a database that contains bird song recordings from 75 species which appear in the Southern Atlantic Coast of South America. Obtained results show that the use of automatically obtained pulses and a SVM classifier produce the best results; all the necessary procedures can be installed in a dedicated hardware, allowing the construction of a specific bird identification device.
引用
收藏
页码:223 / 228
页数:6
相关论文
共 50 条
  • [41] Audio hashing technique for automatic song identification
    Mapelli, F
    Lancini, R
    ITRE2003: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: RESEARCH AND EDUCATION, 2003, : 84 - 88
  • [42] Automatic Detection of Bird Species from Audio Field Recordings using HMM-based Modelling of Frequency Tracks
    Jancovic, Peter
    Kokuer, Munevver
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 1779 - 1783
  • [43] Hierarchical Classification of Bird Species Using Their Audio Recorded Songs
    Silla, Carlos N., Jr.
    Kaestner, Celso A. A.
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 1895 - 1900
  • [44] CONV-CODES: AUDIO HASHING FOR BIRD SPECIES CLASSIFICATION
    Thakur, Anshul
    Sharma, Pulkit
    Abrol, Vinayak
    Rajan, Padmanabhan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8241 - 8245
  • [45] Automatic segmentation of news items based on video and audio features
    Weiqiang Wang
    Wen Gao
    Journal of Computer Science and Technology, 2002, 17 : 189 - 195
  • [46] Automatic segmentation of news items based on video and audio features
    Wang, WQ
    Gao, W
    ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 498 - 505
  • [47] Automatic segmentation of news items based on video and audio features
    Wang, WQ
    Gao, W
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2002, 17 (02) : 189 - 195
  • [48] Audio parameterization with robust frame selection for improved bird identification
    Ventura, Thiago M.
    de Oliveira, Allan G.
    Ganchev, Todor D.
    de Figueiredo, Josiel M.
    Jahn, Olaf
    Marques, Marinez I.
    Schuchmann, Karl-L.
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) : 8463 - 8471
  • [49] VISUAL AND ACOUSTIC IDENTIFICATION OF BIRD SPECIES
    Marini, A.
    Turatti, A. J.
    Britto, A. S., Jr.
    Koerich, A. L.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2309 - 2313
  • [50] Probabilistic approach to automatic music transcription from audio signals
    Miyamoto, Kenichi
    Kameoka, Hirokazu
    Takeda, Haruto
    Nishimoto, Takuya
    Sagayama, Shigeki
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 697 - +