Bird song comparison using deep learning trained from avian perceptual judgments

被引:0
|
作者
Zandberg, Lies [1 ,2 ]
Morfi, Veronica [3 ]
George, Julia M. [2 ,4 ]
Clayton, David F. [2 ,5 ]
Stowell, Dan [3 ,6 ,7 ]
Lachlan, Robert F. [1 ,2 ]
机构
[1] Royal Holloway Univ London, Dept Psychol, London, England
[2] Queen Mary Univ London, Dept Psychol, London, England
[3] Queen Mary Univ London, Ctr Digital Mus C4DM, Machine Listening Lab, London, England
[4] Clemson Univ, Dept Biol Sci, Clemson, SC USA
[5] Clemson Univ, Dept Genet & Biochem, Clemson, SC USA
[6] Tilburg Univ, Dept Cognit Sci & AI, Tilburg, Netherlands
[7] Nat Biodivers Ctr, Leiden, Netherlands
基金
英国生物技术与生命科学研究理事会;
关键词
SWAMP SPARROW; DISCRIMINATION; CATEGORIZATION; MECHANISMS;
D O I
10.1371/journal.pcbi.1012329
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Our understanding of bird song, a model system for animal communication and the neurobiology of learning, depends critically on making reliable, validated comparisons between the complex multidimensional syllables that are used in songs. However, most assessments of song similarity are based on human inspection of spectrograms, or computational methods developed from human intuitions. Using a novel automated operant conditioning system, we collected a large corpus of zebra finches' (Taeniopygia guttata) decisions about song syllable similarity. We use this dataset to compare and externally validate similarity algorithms in widely-used publicly available software (Raven, Sound Analysis Pro, Luscinia). Although these methods all perform better than chance, they do not closely emulate the avian assessments. We then introduce a novel deep learning method that can produce perceptual similarity judgements trained on such avian decisions. We find that this new method outperforms the established methods in accuracy and more closely approaches the avian assessments. Inconsistent (hence ambiguous) decisions are a common occurrence in animal behavioural data; we show that a modification of the deep learning training that accommodates these leads to the strongest performance. We argue this approach is the best way to validate methods to compare song similarity, that our dataset can be used to validate novel methods, and that the general approach can easily be extended to other species. How do birds hear the differences between their songs? This fascinating question carries implications, since the study of bird song, a model system for the neurobiology of learning and animal communication, depends critically on our ability to assess the similarity of songs. Traditionally, researchers compare sounds by human assessment, or use computational methods based on human intuitions about similarity. However, neither approach is connected to birds' own perception of sound similarity. Here, using a novel automated operant conditioning system, we recorded many thousands of acoustic judgments of similarity from zebra finches, and used this perceptual decision data for the first time to train a deep learning system. The trained system outperforms other computational methods for the task of making the same judgments as birds. This algorithm to compare song similarity, together with the potential of extending the general approach to other species, places the study of bird song on a firmer footing.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] An interspecific comparison using immuno fluorescence reveals that synapse density in the avian song system is related to sex but not to male song repertoire size
    Nealen, PM
    BRAIN RESEARCH, 2005, 1032 (1-2) : 50 - 62
  • [22] Assessing the importance of social factors in bird song learning: A test using computer-simulated tutors
    Burt, John M.
    O'Loghlen, Adrian L.
    Templeton, Christopher N.
    Campbell, S. Elizabeth
    Beecher, Michael D.
    ETHOLOGY, 2007, 113 (10) : 917 - 925
  • [23] A deep learning model for the estimation of RF field trained from an analytical solution
    Montin, Eros
    Carluccio, Giuseppe
    Collins, Christopher
    Lattanzi, Riccardo
    2023 IEEE USNC-URSI RADIO SCIENCE MEETING, JOINT WITH AP-S SYMPOSIUM, 2023, : 71 - 72
  • [24] Bird's Eye: Analysis of Video Surveillance Data using Deep Learning
    More, Swapnil
    Thanawala, Aziz
    Shaikh, Tanishq
    Prabhu, Nanadana
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [25] Using deep learning to automate the detection of bird scaring lines on fishing vessels
    Acharya, Debaditya
    Saqib, Muhammad
    Devine, Carlie
    Untiedt, Candice
    Little, L. Richard
    Wang, Dadong
    Tuck, Geoffrey N.
    BIOLOGICAL CONSERVATION, 2024, 296
  • [26] Diabetic Retinopathy Detection: Improving Accuracy Using Multiple Transfer Learning Features from Pre-trained Deep Learning Networks
    Tiong, Kelvin Ka Yung
    Wong, W. K.
    Juwono, Filbert H.
    Chew, I. M.
    2022 INTERNATIONAL CONFERENCE ON GREEN ENERGY, COMPUTING AND SUSTAINABLE TECHNOLOGY (GECOST), 2022, : 171 - 175
  • [27] An Approach of Rhetorical Status Recognition for Judgments in Court Documents using Deep Learning Models
    Tran, Vu D.
    Nguyen, Minh L.
    Shirai, Kiyoaki
    Satoh, Ken
    PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 46 - 51
  • [28] Comparison of Pre-Trained CNNs for Audio Classification Using Transfer Learning
    Tsalera, Eleni
    Papadakis, Andreas
    Samarakou, Maria
    JOURNAL OF SENSOR AND ACTUATOR NETWORKS, 2021, 10 (04)
  • [29] Prediction of surface reflectance using a deep learning model trained on synthetic surface images
    Yoo, Jeonghyun
    Ki, Hyungson
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [30] Kurdish Sign Language Recognition Using Pre-Trained Deep Learning Models
    Alsaud, Ali A.
    Yousif, Raghad Z.
    Aziz, Marwan. M.
    Kareem, Shahab W.
    Maho, Amer J.
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (06) : 1334 - 1344