BirdVoxDetect: Large-Scale Detection and Classification of Flight Calls for Bird Migration Monitoring

被引:1
|
作者
Lostanlen, Vincent [1 ]
Cramer, Aurora [2 ]
Salamon, Justin [3 ]
Farnsworth, Andrew [4 ]
Van Doren, Benjamin M. [4 ]
Kelling, Steve [4 ]
Bello, Juan Pablo [2 ]
机构
[1] Ctr Natl Rech Sci CNRS, Lab Sci Numer Nantes LS2N, F-44300 Nantes, France
[2] NYU, New York, NY 10012 USA
[3] Adobe Res, San Francisco, CA 94107 USA
[4] Cornell Univ, Cornell Lab Ornithol, Ithaca, NY 14850 USA
关键词
Birds; Monitoring; Recording; Machine learning; Background noise; Speech processing; Training; Acoustic signal detection; audio databases; deep learning; ecosystems; phylogeny; WEATHER;
D O I
10.1109/TASLP.2024.3444486
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Sound event classification has the potential to advance our understanding of bird migration. Although it is long known that migratory species have a vocal signature of their own, previous work on automatic flight call classification has been limited in robustness and scope: e.g., covering few recording sites, short acquisition segments, and simplified biological taxonomies. In this paper, we present BirdVoxDetect (BVD), the first full-fledged solution to bird migration monitoring from acoustic sensor network data. As an open-source software, BVD integrates an original pipeline of three machine learning modules. The first module is a random forest classifier of sensor faults, trained with human-in-the-loop active learning. The second module is a deep convolutional neural network for sound event detection with per-channel energy normalization (PCEN). The third module is a multitask convolutional neural network which predicts the family, genus, and species of flight calls from passerines (Passeriformes) of North America. We evaluate BVD on a new dataset (296 hours from nine locations, the largest to date for this task) and discuss the main sources of estimation error in a real-world deployment: mechanical sensor failures, sensitivity to background noise, misdetection, and taxonomic confusion. Then, we deploy BVD to an unprecedented scale: 6672 hours of audio (approximately one terabyte), corresponding to a full season of bird migration. Running BVD in parallel over the full-season dataset yields 1.6 billion FFT's, 480 million neural network predictions, and over six petabytes of throughput. With this method, our main finding is that deep learning and bioacoustic sensor networks are ready to complement radar observations and crowdsourced surveys for bird migration monitoring, thus benefiting conservation ecology and land-use planning at large.
引用
收藏
页码:4134 / 4145
页数:12
相关论文
共 50 条
  • [21] Large-scale detection of repetitions
    Smyth, W. F.
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2014, 372 (2016):
  • [22] Large-scale Packet Classification on FPGA
    Zhou, Shijie
    Qu, Yun R.
    Prasanna, Viktor K.
    PROCEEDINGS OF THE ASAP2015 2015 IEEE 26TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2015, : 226 - 233
  • [23] Consequences of large-scale processes for the conservation of bird populations
    Baillie, SR
    Sutherland, WJ
    Freeman, SN
    Gregory, RD
    Paradis, E
    JOURNAL OF APPLIED ECOLOGY, 2000, 37 : 88 - 102
  • [24] Large-Scale Robust Semisupervised Classification
    Zhang, Lingling
    Luo, Minnan
    Li, Zhihui
    Nie, Feiping
    Zhang, Huaxiang
    Liu, Jun
    Zheng, Qinghua
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (03) : 907 - 917
  • [25] A large-scale classification of English verbs
    Karin Kipper
    Anna Korhonen
    Neville Ryant
    Martha Palmer
    Language Resources and Evaluation, 2008, 42 : 21 - 40
  • [26] Hierarchical Classification for Large-Scale Learning
    Wang, Boshi
    Barbu, Adrian
    ELECTRONICS, 2023, 12 (22)
  • [27] A large-scale classification of English verbs
    Kipper, Karin
    Korhonen, Anna
    Ryant, Neville
    Palmer, Martha
    LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (01) : 21 - 40
  • [28] LARGE-SCALE EFFECTS ON BIRD ASSEMBLAGES IN DESERT GRASSLANDS
    Block, Giselle
    Morrison, Michael L.
    WESTERN NORTH AMERICAN NATURALIST, 2010, 70 (01) : 19 - 25
  • [29] Problems in Large-Scale Image Classification
    Guo, Yuchen
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5038 - 5039
  • [30] The Survey of Large-scale Query Classification
    Zhou, Sanduo
    Cheng, Kefei
    Men, Lijun
    2017 5TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION (CDMMS 2017), 2017, 1834