Bird song comparison using deep learning trained from avian perceptual judgments

被引:0
|
作者
Zandberg, Lies [1 ,2 ]
Morfi, Veronica [3 ]
George, Julia M. [2 ,4 ]
Clayton, David F. [2 ,5 ]
Stowell, Dan [3 ,6 ,7 ]
Lachlan, Robert F. [1 ,2 ]
机构
[1] Royal Holloway Univ London, Dept Psychol, London, England
[2] Queen Mary Univ London, Dept Psychol, London, England
[3] Queen Mary Univ London, Ctr Digital Mus C4DM, Machine Listening Lab, London, England
[4] Clemson Univ, Dept Biol Sci, Clemson, SC USA
[5] Clemson Univ, Dept Genet & Biochem, Clemson, SC USA
[6] Tilburg Univ, Dept Cognit Sci & AI, Tilburg, Netherlands
[7] Nat Biodivers Ctr, Leiden, Netherlands
基金
英国生物技术与生命科学研究理事会;
关键词
SWAMP SPARROW; DISCRIMINATION; CATEGORIZATION; MECHANISMS;
D O I
10.1371/journal.pcbi.1012329
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Our understanding of bird song, a model system for animal communication and the neurobiology of learning, depends critically on making reliable, validated comparisons between the complex multidimensional syllables that are used in songs. However, most assessments of song similarity are based on human inspection of spectrograms, or computational methods developed from human intuitions. Using a novel automated operant conditioning system, we collected a large corpus of zebra finches' (Taeniopygia guttata) decisions about song syllable similarity. We use this dataset to compare and externally validate similarity algorithms in widely-used publicly available software (Raven, Sound Analysis Pro, Luscinia). Although these methods all perform better than chance, they do not closely emulate the avian assessments. We then introduce a novel deep learning method that can produce perceptual similarity judgements trained on such avian decisions. We find that this new method outperforms the established methods in accuracy and more closely approaches the avian assessments. Inconsistent (hence ambiguous) decisions are a common occurrence in animal behavioural data; we show that a modification of the deep learning training that accommodates these leads to the strongest performance. We argue this approach is the best way to validate methods to compare song similarity, that our dataset can be used to validate novel methods, and that the general approach can easily be extended to other species. How do birds hear the differences between their songs? This fascinating question carries implications, since the study of bird song, a model system for the neurobiology of learning and animal communication, depends critically on our ability to assess the similarity of songs. Traditionally, researchers compare sounds by human assessment, or use computational methods based on human intuitions about similarity. However, neither approach is connected to birds' own perception of sound similarity. Here, using a novel automated operant conditioning system, we recorded many thousands of acoustic judgments of similarity from zebra finches, and used this perceptual decision data for the first time to train a deep learning system. The trained system outperforms other computational methods for the task of making the same judgments as birds. This algorithm to compare song similarity, together with the potential of extending the general approach to other species, places the study of bird song on a firmer footing.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Automatic detection of migrating soaring bird flocks using weather radars by deep learning
    Schekler, Inbal
    Nave, Tamir
    Shimshoni, Ilan
    Sapir, Nir
    METHODS IN ECOLOGY AND EVOLUTION, 2023, 14 (08): : 2084 - 2094
  • [42] Automatic Bird-Species Recognition using the Deep Learning and Web Data Mining
    Kang, Min-Seok
    Hong, Kwang-Seok
    2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 1258 - 1260
  • [43] Vehicle trajectory prediction on highways using bird eye view representations and deep learning
    Rubén Izquierdo
    Álvaro Quintanar
    David Fernández Llorca
    Iván García Daza
    Noelia Hernández
    Ignacio Parra
    Miguel Ángel Sotelo
    Applied Intelligence, 2023, 53 : 8370 - 8388
  • [44] Traffic Sign Recognition Using Kernel Extreme Learning Machines With Deep Perceptual Features
    Zeng, Yujun
    Xu, Xin
    Shen, Dayong
    Fang, Yuqiang
    Xiao, Zhipeng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (06) : 1647 - 1653
  • [45] Removing Human Bottlenecks in Bird Classification Using Camera Trap Images and Deep Learning
    Chalmers, Carl
    Fergus, Paul
    Wich, Serge
    Longmore, Steven. N. N.
    Walsh, Naomi Davies
    Stephens, Philip. A. A.
    Sutherland, Chris
    Matthews, Naomi
    Mudde, Jens
    Nuseibeh, Amira
    REMOTE SENSING, 2023, 15 (10)
  • [46] Extracting Decision Tree From Trained Deep Reinforcement Learning in Traffic Signal Control
    Zhu, Yuanyang
    Yin, Xiao
    Chen, Chunlin
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (04) : 1997 - 2007
  • [47] Vehicle trajectory prediction on highways using bird eye view representations and deep learning
    Izquierdo, Ruben
    Quintanar, Alvaro
    Fernandez Llorca, David
    Garcia Daza, Ivan
    Hernandez, Noelia
    Parra, Ignacio
    Angel Sotelo, Miguel
    APPLIED INTELLIGENCE, 2023, 53 (07) : 8370 - 8388
  • [48] Aerial-trained deep learning networks for surveying cetaceans from satellite imagery
    Borowicz, Alex
    Le, Hieu
    Humphries, Grant
    Nehls, Georg
    Hoeschle, Caroline
    Kosarev, Vladislav
    Lynch, Heather J.
    PLOS ONE, 2019, 14 (10):
  • [49] Extracting Decision Tree from Trained Deep Reinforcement Learning in Traffic Signal Control
    Zhu, Yuanyang
    Yin, Xiao
    Li, Ruyu
    Chen, Chunlin
    2021 INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SOCIAL INTELLIGENCE (ICCSI), 2021,
  • [50] Automated summarisation of SDOCT volumes using deep learning: Transfer learning vs de novo trained networks
    Antony, Bhavna Josephine
    Maetschke, Stefan
    Garnavi, Rahil
    PLOS ONE, 2019, 14 (05):