Adaptive energy detection for bird sound detection in complex environments

被引:25
|
作者
Zhang, Xiaoxia [1 ,2 ]
Li, Ying [2 ]
机构
[1] Fujian Med Univ, Informat Ctr, Fuzhou 350108, Peoples R China
[2] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350108, Peoples R China
关键词
Adaptive energy detection; Noise variance estimation; Mel-scaled Wavelet packet decomposition; Sub-band Cepstral Coefficient (MWSCC); Mel-Frequency Cepstral Coefficient (MFCC); Bird sound recognition; SUPPORT VECTOR MACHINES; PARAMETRIC REPRESENTATIONS; RECOGNITION; SPEAKER;
D O I
10.1016/j.neucom.2014.12.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new bird sound classification approach based on adaptive energy detection was proposed to improve the recognition accuracy of bird sounds in noisy environments. In this paper, the bird sounds with background noises were divided into three linear frequency bands according to their frequency distribution in spectrogram. The noise spectrum of each band was estimated and the existent probability of the foreground bird sound for each band was computed to serve for the adaptive threshold of energy detection. These foreground bird sound signals were detected and selected via adaptive energy detection from the bird sounds with background noises. Then, the features of Mel-scaled Wavelet packet decomposition Sub-band Cepstral Coefficient (MWSCC) and Mel-Frequency Cepstral Coefficient (MFCC) were extracted from the above signals for classification by using the classifier of Support Vector Machine (SVM), respectively. Moreover, the differences of recognition performance were implemented on 30 kinds of bird sounds at different Signal-to-Noise Ratios (SNRs) under different noisy environments, before or after adaptive energy detection. The results show that MWSCC has better noise immunity function, and the recognition performance after adaptive energy detection improves more significantly, indicating that it is a very suitable approach for the bird sound recognition in complex environments. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:108 / 116
页数:9
相关论文
共 50 条
  • [31] In situ detection of the protein corona in complex environments
    Carril, Monica
    Padro, Daniel
    del Pino, Pablo
    Carrillo-Carrion, Carolina
    Gallego, Marta
    Parak, Wolfgang J.
    NATURE COMMUNICATIONS, 2017, 8
  • [32] Dense pedestrian face detection in complex environments
    Gao, Qiang
    Ding, Bingru
    Jia, Xu
    Xie, Yinghong
    Han, Xiaowei
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [33] Surface Target Saliency Detection in Complex Environments
    Yang, Benxin
    Chen, Yaojie
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 664 - 675
  • [34] Verbal aggression detection in complex social environments
    van Hengel, P. W. J.
    Andringa, T. C.
    2007 IEEE CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, 2007, : 15 - +
  • [35] Detection of nanoparticle aggregation in complex, biological environments
    Jenkins, Samir V.
    Chen, Jingyi
    Zhang, Yongbin
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2014, 247
  • [36] Surface Target Saliency Detection in Complex Environments
    Yang, Benxin
    Chen, Yaojie
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2023, 14087 LNCS : 664 - 675
  • [37] Adaptive real-time road detection using VRay and A-MSRG in complex environments
    Weon, SunHee
    Joo, SungIl
    Choi, HyungIl
    VIDEO SURVEILLANCE AND TRANSPORTATION IMAGING APPLICATIONS, 2013, 8663
  • [38] Sound Event Detection for Human Safety and Security in Noisy Environments
    Neri, Michael
    Battisti, Federica
    Neri, Alessandro
    Carli, Marco
    IEEE ACCESS, 2022, 10 : 134230 - 134240
  • [39] Semantic Gap Detection in Metadata of Adaptive Learning Environments
    Sosnovsky, Sergey
    Alpizar Chacon, Isaac
    2014 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT), 2014, : 548 - +
  • [40] Adaptive Strategies for Target Detection and Localization in Noisy Environments
    Iwen, Mark A.
    Tewfik, Ahmed H.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (05) : 2344 - 2353