BirdVoxDetect: Large-Scale Detection and Classification of Flight Calls for Bird Migration Monitoring

被引:1
|
作者
Lostanlen, Vincent [1 ]
Cramer, Aurora [2 ]
Salamon, Justin [3 ]
Farnsworth, Andrew [4 ]
Van Doren, Benjamin M. [4 ]
Kelling, Steve [4 ]
Bello, Juan Pablo [2 ]
机构
[1] Ctr Natl Rech Sci CNRS, Lab Sci Numer Nantes LS2N, F-44300 Nantes, France
[2] NYU, New York, NY 10012 USA
[3] Adobe Res, San Francisco, CA 94107 USA
[4] Cornell Univ, Cornell Lab Ornithol, Ithaca, NY 14850 USA
关键词
Birds; Monitoring; Recording; Machine learning; Background noise; Speech processing; Training; Acoustic signal detection; audio databases; deep learning; ecosystems; phylogeny; WEATHER;
D O I
10.1109/TASLP.2024.3444486
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Sound event classification has the potential to advance our understanding of bird migration. Although it is long known that migratory species have a vocal signature of their own, previous work on automatic flight call classification has been limited in robustness and scope: e.g., covering few recording sites, short acquisition segments, and simplified biological taxonomies. In this paper, we present BirdVoxDetect (BVD), the first full-fledged solution to bird migration monitoring from acoustic sensor network data. As an open-source software, BVD integrates an original pipeline of three machine learning modules. The first module is a random forest classifier of sensor faults, trained with human-in-the-loop active learning. The second module is a deep convolutional neural network for sound event detection with per-channel energy normalization (PCEN). The third module is a multitask convolutional neural network which predicts the family, genus, and species of flight calls from passerines (Passeriformes) of North America. We evaluate BVD on a new dataset (296 hours from nine locations, the largest to date for this task) and discuss the main sources of estimation error in a real-world deployment: mechanical sensor failures, sensitivity to background noise, misdetection, and taxonomic confusion. Then, we deploy BVD to an unprecedented scale: 6672 hours of audio (approximately one terabyte), corresponding to a full season of bird migration. Running BVD in parallel over the full-season dataset yields 1.6 billion FFT's, 480 million neural network predictions, and over six petabytes of throughput. With this method, our main finding is that deep learning and bioacoustic sensor networks are ready to complement radar observations and crowdsourced surveys for bird migration monitoring, thus benefiting conservation ecology and land-use planning at large.
引用
收藏
页码:4134 / 4145
页数:12
相关论文
共 50 条
  • [31] Large-scale monitoring in environmental geochemistry
    Selinus, O
    APPLIED GEOCHEMISTRY, 1996, 11 (1-2) : 251 - 260
  • [32] Large-scale structural monitoring systems
    Solomon, I
    Cunnane, J
    Stevenson, P
    NONDESTRUCTIVE EVALUATION OF HIGHWAYS, UTILITIES, AND PIPELINES IV, 2000, 3995 : 276 - 287
  • [33] Large-Scale Evidence for the Effectiveness of Partisan GOTV Robo Calls
    Kling, Daniel T.
    Stratmann, Thomas
    JOURNAL OF EXPERIMENTAL POLITICAL SCIENCE, 2023, 10 (02) : 188 - 200
  • [34] Large-scale video monitoring system
    Kobayashi, Kazuaki
    NEC Technical Journal, 2010, 5 (03): : 39 - 42
  • [35] Air monitoring in large-scale operations
    Technische Uberwachung, 2008, 49 (10):
  • [36] Large-scale biological monitoring in Japan
    Ogata, M
    Numano, T
    Hosokawa, M
    Michitsuji, H
    SCIENCE OF THE TOTAL ENVIRONMENT, 1997, 199 (1-2) : 197 - 204
  • [37] Do observer differences in bird detection affect inferences from large-scale ecological studies?
    Lindenmayer, David B.
    Wood, Jeff T.
    MacGregor, Christopher
    EMU-AUSTRAL ORNITHOLOGY, 2009, 109 (02): : 100 - 106
  • [38] Combined Multiclass Classification and Anomaly Detection for Large-Scale Wireless Sensor Networks
    Shilton, Alistair
    Rajasegarar, Sutharshan
    Palaniswami, Marimuthu
    2013 IEEE EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SENSORS, SENSOR NETWORKS AND INFORMATION PROCESSING, 2013, : 491 - 496
  • [39] Fake News Detection in Large-Scale Social Network with Generalized Bayesian Classification
    Zhang, Wei
    Alzahrani, Ahmed Ibrahim
    Lee, Mi Young
    MOBILE NETWORKS & APPLICATIONS, 2024,
  • [40] UPAD: A Large-Scale Passive Sonar Benchmark Dataset for Vessel Detection and Classification
    Fischer, John
    Orescanin, Marko
    OCEANS 2024 - SINGAPORE, 2024,