Acoustic auto-encoders for biodiversity assessment

被引:12
|
作者
Rowe, Benjamin [1 ]
Eichinski, Philip [1 ]
Zhang, Jinglan [1 ]
Roe, Paul [1 ]
机构
[1] Queensland Univ Technol, Brisbane, Qld, Australia
关键词
Eco-acoustics; Deep learning; Auto-encoders;
D O I
10.1016/j.ecoinf.2021.101237
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Continuous audio recordings are playing an ever more important role in conservation and biodiversity monitoring, however, listening to these recordings is often infeasible, as they can be thousands of hours long. Automating analysis using machine learning algorithms requires a feature representation. In this paper we propose a technique for learning a general feature representation from unlabelled audio using auto-encoders, which can be used for analysing environmental audio on a small timescale. We start by segmenting the audio data into non-overlapping 1-s long chunks and generating audio spectrograms. These audio spectrograms are then used to train a basic auto-encoder, with the output of the encoder network being used to generate the feature representation. We have found that at a 1-s timescale, our feature representation offers marginal improvements over ?acoustic indices?, a common representation for analysing environmental audio.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Fisher Auto-Encoders
    Elkhalil, Khalil
    Hasan, Ali
    Ding, Jie
    Farsiu, Sina
    Tarokh, Vahid
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 352 - 360
  • [2] Ornstein Auto-Encoders
    Choi, Youngwon
    Won, Joong-Ho
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2172 - 2178
  • [3] Transforming Auto-Encoders
    Hinton, Geoffrey E.
    Krizhevsky, Alex
    Wang, Sida D.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT I, 2011, 6791 : 44 - 51
  • [4] Correlated Variational Auto-Encoders
    Tang, Da
    Liang, Dawen
    Jebara, Tony
    Ruozzi, Nicholas
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [5] Hyperspherical Variational Auto-Encoders
    Davidson, Tim R.
    Falorsi, Luca
    De Cao, Nicola
    Kipf, Thomas
    Tomczak, Jakub M.
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 856 - 865
  • [6] Directed Graph Auto-Encoders
    Kollias, Georgios
    Kalantzis, Vasileios
    Ide, Tsuyoshi
    Lozano, Aurelie
    Abe, Naoki
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7211 - 7219
  • [7] Graph Attention Auto-Encoders
    Salehi, Amin
    Davulcu, Hasan
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 989 - 996
  • [8] Conservativeness of Untied Auto-Encoders
    Im, Daniel Jiwoong
    Belghazi, Mohamed Ishmael
    Memisevic, Roland
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1694 - 1700
  • [9] Towards Asymmetric Auto-Encoders for the IoBNT
    Angerbauer, Stefan
    Khanzadeh, Roya
    Enzenhofer, Franz
    Springer, Andreas
    Haselmayr, Werner
    PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON NANOSCALE COMPUTING AND COMMUNICATION, NANOCOM 2023, 2023, : 166 - 167
  • [10] TURBO: The Swiss Knife of Auto-Encoders
    Quetant, Guillaume
    Belousov, Yury
    Kinakh, Vitaliy
    Voloshynovskiy, Slava
    ENTROPY, 2023, 25 (10)