Bird song comparison using deep learning trained from avian perceptual judgments

被引:0
|
作者
Zandberg, Lies [1 ,2 ]
Morfi, Veronica [3 ]
George, Julia M. [2 ,4 ]
Clayton, David F. [2 ,5 ]
Stowell, Dan [3 ,6 ,7 ]
Lachlan, Robert F. [1 ,2 ]
机构
[1] Royal Holloway Univ London, Dept Psychol, London, England
[2] Queen Mary Univ London, Dept Psychol, London, England
[3] Queen Mary Univ London, Ctr Digital Mus C4DM, Machine Listening Lab, London, England
[4] Clemson Univ, Dept Biol Sci, Clemson, SC USA
[5] Clemson Univ, Dept Genet & Biochem, Clemson, SC USA
[6] Tilburg Univ, Dept Cognit Sci & AI, Tilburg, Netherlands
[7] Nat Biodivers Ctr, Leiden, Netherlands
基金
英国生物技术与生命科学研究理事会;
关键词
SWAMP SPARROW; DISCRIMINATION; CATEGORIZATION; MECHANISMS;
D O I
10.1371/journal.pcbi.1012329
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Our understanding of bird song, a model system for animal communication and the neurobiology of learning, depends critically on making reliable, validated comparisons between the complex multidimensional syllables that are used in songs. However, most assessments of song similarity are based on human inspection of spectrograms, or computational methods developed from human intuitions. Using a novel automated operant conditioning system, we collected a large corpus of zebra finches' (Taeniopygia guttata) decisions about song syllable similarity. We use this dataset to compare and externally validate similarity algorithms in widely-used publicly available software (Raven, Sound Analysis Pro, Luscinia). Although these methods all perform better than chance, they do not closely emulate the avian assessments. We then introduce a novel deep learning method that can produce perceptual similarity judgements trained on such avian decisions. We find that this new method outperforms the established methods in accuracy and more closely approaches the avian assessments. Inconsistent (hence ambiguous) decisions are a common occurrence in animal behavioural data; we show that a modification of the deep learning training that accommodates these leads to the strongest performance. We argue this approach is the best way to validate methods to compare song similarity, that our dataset can be used to validate novel methods, and that the general approach can easily be extended to other species. How do birds hear the differences between their songs? This fascinating question carries implications, since the study of bird song, a model system for the neurobiology of learning and animal communication, depends critically on our ability to assess the similarity of songs. Traditionally, researchers compare sounds by human assessment, or use computational methods based on human intuitions about similarity. However, neither approach is connected to birds' own perception of sound similarity. Here, using a novel automated operant conditioning system, we recorded many thousands of acoustic judgments of similarity from zebra finches, and used this perceptual decision data for the first time to train a deep learning system. The trained system outperforms other computational methods for the task of making the same judgments as birds. This algorithm to compare song similarity, together with the potential of extending the general approach to other species, places the study of bird song on a firmer footing.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Generalization Ability of Deep Learning Algorithms Trained Using SEM Data for Objects Classification
    Zaky, Yasmina
    Fortino, Nicolas
    Miramond, Benoit
    Dauvignac, Jean-Yves
    RADIO SCIENCE, 2022, 57 (12)
  • [32] Development of a deep learning network using a pre-trained convolutional neural network
    Rooney, M.
    Mitchell, J.
    McLaren, D. B.
    Nailon, W. H.
    RADIOTHERAPY AND ONCOLOGY, 2019, 133 : S1051 - S1052
  • [33] Tomato crop disease classification using pre-trained deep learning algorithm
    Rangarajan, Aravind Krishnaswamy
    Purushothaman, Raja
    Ramesh, Aniirudh
    INTERNATIONAL CONFERENCE ON ROBOTICS AND SMART MANUFACTURING (ROSMA2018), 2018, 133 : 1040 - 1047
  • [34] Early Prediction of Lung Cancers Using Deep Saliency Capsule and Pre-Trained Deep Learning Frameworks
    Ramana, Kadiyala
    Kumar, Madapuri Rudra
    Sreenivasulu, K.
    Gadekallu, Thippa Reddy
    Bhatia, Surbhi
    Agarwal, Parul
    Idrees, Sheikh Mohammad
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [35] Topology Optimization using Deep Learning ——Comparison of Simultaneous and Additional Learning——
    Sasaki H.
    Hidaka Y.
    Igarashi H.
    IEEJ Transactions on Power and Energy, 2020, 140 (12) : 858 - 865
  • [36] Estimating the complexity of bird song by using capture-recapture approaches from community ecology
    Garamszegi, LZ
    Balsby, TJS
    Bell, BD
    Borowiec, M
    Byers, BE
    Draganoiu, T
    Eens, M
    Forstmeier, W
    Galeotti, P
    Gil, D
    Gorissen, L
    Hansen, P
    Lampe, HM
    Leitner, S
    Lontkowski, J
    Nagle, L
    Nemeth, E
    Pinxten, R
    Rossi, JM
    Saino, N
    Tanvez, A
    Titus, R
    Török, J
    Van Duyse, E
    Müller, AP
    BEHAVIORAL ECOLOGY AND SOCIOBIOLOGY, 2005, 57 (04) : 305 - 317
  • [37] Estimating the complexity of bird song by using capture-recapture approaches from community ecology
    László Z. Garamszegi
    Thorsten J. S. Balsby
    Ben D. Bell
    Marta Borowiec
    Bruce E. Byers
    Tudor Draganoiu
    Marcel Eens
    Wolfgang Forstmeier
    Paolo Galeotti
    Diego Gil
    Leen Gorissen
    Poul Hansen
    Helene M. Lampe
    Stefan Leitner
    Jan Lontkowski
    Laurent Nagle
    Erwin Nemeth
    Rianne Pinxten
    Jean-Marc Rossi
    Nicola Saino
    Aurélie Tanvez
    Russell Titus
    János Török
    Els Van Duyse
    Anders P. Møller
    Behavioral Ecology and Sociobiology, 2005, 57 : 305 - 317
  • [38] Automated classification of bird and amphibian calls using machine learning: A comparison of methods
    Acevedo, Miguel A.
    Corrada-Bravo, Carlos J.
    Corrada-Bravo, Hector
    Villanueva-Rivera, Luis J.
    Aide, T. Mitchell
    ECOLOGICAL INFORMATICS, 2009, 4 (04) : 206 - 214
  • [39] Effective near-duplicate image detection using perceptual hashing and deep learning
    Jakhar, Yash
    Borah, Malaya Dutta
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (04)
  • [40] Automatic Classification of Bird Sounds: Using MFCC and Mel Spectrogram Features with Deep Learning
    Carvalho, Silvestre
    Gomes, Elsa Ferreira
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2023, 10 (01) : 39 - 54