Bioacoustic classification of a small dataset of mammalian vocalisations using deep learning

被引:1
|
作者
Manriquez P, P. Rodrigo [1 ,2 ]
Kotz, Sonja A. [2 ,3 ]
Ravignani, Andrea [4 ,5 ,6 ]
de Boer, Bart [1 ]
机构
[1] Vrije Univ Brussel, Artificial Intelligence Lab, Brussels, Belgium
[2] Maastricht Univ, Fac Psychol & Neurosci, Dept Neuropsychol & Psychopharmacol, Maastricht, Netherlands
[3] Max Planck Inst Human Cognit & Brain Sci, Dept Neuropsychol, Leipzig, Germany
[4] Sapienza Univ Rome, Dept Human Neurosci, Rome, Italy
[5] Aarhus Univ, Ctr Mus Brain, Dept Clin Med, Aarhus, Denmark
[6] Royal Acad Mus Aarhus Aalborg, Aarhus, Denmark
基金
新加坡国家研究基金会;
关键词
Artificial intelligence; machine learning; species recognition; species discrimination; DATA AUGMENTATION; ACOUSTIC-SIGNAL; NEURAL-NETWORKS; IDENTIFICATION;
D O I
10.1080/09524622.2024.2354468
中图分类号
Q95 [动物学];
学科分类号
071002 ;
摘要
In the last few years, automatic extraction and classification of animal vocalisations has been facilitated by machine learning (ML) and deep learning (DL) methods. Different frameworks allowed researchers to automatically extract features and perform classification tasks, aiding in call identification and species recognition. However, the success of these applications relies on the amount of available data to train these algorithms. The lack of sufficient data can also lead to overfitting and affect generalisation (i.e. poor performance on out-of-sample data). Further, acquiring large data sets is costly and annotating them is time consuming. Thus, how small can a dataset be to still provide useful information by means of ML or DL? Here, we show how convolutional neural network architectures can handle small datasets in a bioacoustic classification task of affective mammalian vocalisations. We explain how these techniques can be used (e.g. pre-training and data augmentation), and emphasise how to implement them in concordance with features of bioacoustic signals. We further discuss whether these networks can generalise the affective quality of vocalisations across different taxa.
引用
收藏
页码:354 / 371
页数:18
相关论文
共 50 条
  • [1] Deep Learning-based Mammogram Classification using Small Dataset
    Adedigba, Adeyinka P.
    Adeshina, Steve A.
    Aibinu, Abiodun M.
    2019 15TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTER AND COMPUTATION (ICECCO), 2019,
  • [2] Performance Evaluation of Deep Learning Models on Mammogram Classification Using Small Dataset
    Adedigba, Adeyinka P.
    Adeshina, Steve A.
    Aibinu, Abiodun M.
    BIOENGINEERING-BASEL, 2022, 9 (04):
  • [3] Using Deep Learning for Defect Classification on a Small Weld X-ray Image Dataset
    Ajmi, Chiraz
    Zapata, Juan
    Martinez-Alvarez, Jose Javier
    Domenech, Gines
    Ruiz, Ramon
    JOURNAL OF NONDESTRUCTIVE EVALUATION, 2020, 39 (03)
  • [4] Deep Learning-Based Plant-Image Classification Using a Small Training Dataset
    Batchuluun, Ganbayar
    Nam, Se Hyun
    Park, Kang Ryoung
    MATHEMATICS, 2022, 10 (17)
  • [5] Using Deep Learning for Defect Classification on a Small Weld X-ray Image Dataset
    Chiraz Ajmi
    Juan Zapata
    José Javier Martínez-Álvarez
    Ginés Doménech
    Ramón Ruiz
    Journal of Nondestructive Evaluation, 2020, 39
  • [6] FUSING SHALLOW AND DEEP LEARNING FOR BIOACOUSTIC BIRD SPECIES CLASSIFICATION
    Salamon, Justin
    Bellol, Juan Pablo
    Farnsworth, Andrew
    Kelling, Steve
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 141 - 145
  • [7] SAMPLE Dataset Objects Classification Using Deep Learning Algorithms
    Turcanik, Michal
    Perdoch, Jozef
    RADIOENGINEERING, 2023, 32 (01) : 63 - 73
  • [8] Breast Cancer Dataset, Classification and Detection Using Deep Learning
    Iqbal, Muhammad Shahid
    Ahmad, Waqas
    Alizadehsani, Roohallah
    Hussain, Sadiq
    Rehman, Rizwan
    HEALTHCARE, 2022, 10 (12)
  • [9] CentralBark Image Dataset and Tree Species Classification Using Deep Learning
    Warner, Charles
    Wu, Fanyou
    Gazo, Rado
    Benes, Bedrich
    Kong, Nicole
    Fei, Songlin
    ALGORITHMS, 2024, 17 (05)
  • [10] Deep metric learning for bioacoustic classification: Overcoming training data scarcity using dynamic triplet loss
    Thakur, Anshul
    Thapar, Daksh
    Rajan, Padmanabhan
    Nigam, Aditya
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (01): : 534 - 547