Deep Learning Based Open Set Acoustic Scene Classification

被引:4
|
作者
Kwiatkowska, Zuzanna [1 ]
Kalinowski, Benjamin [1 ]
Kosmider, Michal [1 ]
Rykaczewski, Krzysztof [1 ]
机构
[1] Samsung R&D Inst, Warsaw, Poland
来源
INTERSPEECH 2020 | 2020年
关键词
acoustic scenes; open set classification; deep learning; conditional autoencoders;
D O I
10.21437/Interspeech.2020-3092
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this work, we compare the performance of three selected techniques in open set acoustic scenes classification (ASC). We test thresholding of the softmax output of a deep network classifier, which is the most popular technique nowadays employed in ASC. Further we compare the results with the Openmax classifier which is derived from the computer vision field. As the third model, we use the Adapted Class-Conditioned Autoencoder (Adapted C2AE) which is our variation of another computer vision related technique called C2AE. Adapted C2AE encompasses a more fair comparison of the given experiments and simplifies the original inference procedure, making it more applicable in the real-life scenarios. We also analyse two training scenarios: without additional knowledge of unknown classes and another where a limited subset of examples from the unknown classes is available. We find that the C2AE based method outperforms the thresholding and Openmax, obtaining 85.5% Area Under the Receiver Operating Characteristic curve (AUROC) and 66% of open set accuracy on data used in Detection and Classification of Acoustic Scenes and Events Challenge 2019 Task 1C.
引用
收藏
页码:1216 / 1220
页数:5
相关论文
共 50 条
  • [1] THE OPEN-SET PROBLEM IN ACOUSTIC SCENE CLASSIFICATION
    Battaglino, Daniele
    Lepauloux, Ludovick
    Evans, Nicholas
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [2] A Review of Deep Learning Based Methods for Acoustic Scene Classification
    Abesser, Jakob
    APPLIED SCIENCES-BASEL, 2020, 10 (06):
  • [3] Deep semantic learning for acoustic scene classification
    Yun-Fei Shao
    Xin-Xin Ma
    Yong Ma
    Wei-Qiang Zhang
    EURASIP Journal on Audio, Speech, and Music Processing, 2024
  • [4] Deep semantic learning for acoustic scene classification
    Shao, Yun-Fei
    Ma, Xin-Xin
    Ma, Yong
    Zhang, Wei-Qiang
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01)
  • [5] Acoustic Scene Classification using Deep Learning Architectures
    Spoorthy, V
    Mulimani, Manjunath
    Koolagudi, Shashidhar G.
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [6] Deep Open-Set Domain Adaptation for Cross-Scene Classification based on Adversarial Learning and Pareto Ranking
    Adayel, Reham
    Bazi, Yakoub
    Alhichri, Haikel
    Alajlan, Naif
    REMOTE SENSING, 2020, 12 (11)
  • [7] Deep Learning Based Audio Scene Classification
    Sophiya, E.
    Jothilakshmi, S.
    COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS: MODELS AND TECHNIQUES FOR INTELLIGENT SYSTEMS AND AUTOMATION, 2018, 844 : 98 - 109
  • [8] AN OPEN SET DOMAIN ADAPTATION NETWORK BASED ON ADVERSARIAL LEARNING FOR REMOTE SENSING IMAGE SCENE CLASSIFICATION
    Zhang, Jun
    Liu, Jiao
    Shi, Lukui
    Pan, Bin
    Xu, Xia
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1365 - 1368
  • [9] Deep Scalogram Representations for Acoustic Scene Classification
    Ren, Zhao
    Qian, Kun
    Zhang, Zixing
    Pandit, Vedhas
    Baird, Alice
    Schuller, Bjoern
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2018, 5 (03) : 662 - 669
  • [10] Deep Scalogram Representations for Acoustic Scene Classification
    Zhao Ren
    Kun Qian
    Zixing Zhang
    Vedhas Pandit
    Alice Baird
    Bjrn Schuller
    IEEE/CAA Journal of Automatica Sinica, 2018, 5 (03) : 662 - 669