Deep Learning Based Open Set Acoustic Scene Classification

被引:4
|
作者
Kwiatkowska, Zuzanna [1 ]
Kalinowski, Benjamin [1 ]
Kosmider, Michal [1 ]
Rykaczewski, Krzysztof [1 ]
机构
[1] Samsung R&D Inst, Warsaw, Poland
来源
INTERSPEECH 2020 | 2020年
关键词
acoustic scenes; open set classification; deep learning; conditional autoencoders;
D O I
10.21437/Interspeech.2020-3092
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this work, we compare the performance of three selected techniques in open set acoustic scenes classification (ASC). We test thresholding of the softmax output of a deep network classifier, which is the most popular technique nowadays employed in ASC. Further we compare the results with the Openmax classifier which is derived from the computer vision field. As the third model, we use the Adapted Class-Conditioned Autoencoder (Adapted C2AE) which is our variation of another computer vision related technique called C2AE. Adapted C2AE encompasses a more fair comparison of the given experiments and simplifies the original inference procedure, making it more applicable in the real-life scenarios. We also analyse two training scenarios: without additional knowledge of unknown classes and another where a limited subset of examples from the unknown classes is available. We find that the C2AE based method outperforms the thresholding and Openmax, obtaining 85.5% Area Under the Receiver Operating Characteristic curve (AUROC) and 66% of open set accuracy on data used in Detection and Classification of Acoustic Scenes and Events Challenge 2019 Task 1C.
引用
收藏
页码:1216 / 1220
页数:5
相关论文
共 50 条
  • [41] Blood Vessel Delineation in Endoscopic Images with Deep Learning Based Scene Classification
    Golhar, Mayank
    Iwahori, Yuji
    Bhuyan, M. K.
    Funahashi, Kenji
    Kasugai, Kunio
    PATTERN RECOGNITION APPLICATIONS AND METHODS, 2018, 10857 : 147 - 168
  • [42] A Multichannel-Based Deep Learning Framework for Ocean SAR Scene Classification
    Bai, Chengzu
    Zhang, Shuo
    Wang, Xinning
    Wen, Jiaqiang
    Li, Chong
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [43] Remote Sensing Image Scene Classification Based on SURF Feature and Deep Learning
    Liang, Jinxiang
    Dang, Jianwu
    Wang, Yangping
    Yang, Jingyu
    Zhang, Zhenhai
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1128 - 1133
  • [44] Exemplar based Deep Discriminative and Shareable Feature Learning for scene image classification
    Zuo, Zhen
    Wang, Gang
    Shuai, Bing
    Zhao, Lifan
    Yang, Qingxiong
    PATTERN RECOGNITION, 2015, 48 (10) : 3004 - 3015
  • [45] ACOUSTIC SCENE CLASSIFICATION USING SPARSE FEATURE LEARNING AND EVENT-BASED POOLING
    Lee, Kyogu
    Hyung, Ziwon
    Nam, Juhan
    2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
  • [46] Open Set Recognition of Communication Signal Modulation Based on Deep Learning
    Zhang, Xinliang
    Li, Tianyun
    Gong, Pei
    Liu, Renwei
    Zha, Xiong
    Tang, Wenqi
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (07) : 1588 - 1592
  • [47] Traffic Scene Analysis and Classification using Deep Learning
    Dorrani, Z.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2024, 37 (03): : 496 - 502
  • [48] ACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING
    Bisot, Victor
    Serizel, Romain
    Essid, Slim
    Richard, Gael
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6445 - 6449
  • [49] Online Continual Learning in Acoustic Scene Classification: An Empirical Study
    Ha, Donghee
    Kim, Mooseop
    Jeong, Chi Yoon
    SENSORS, 2023, 23 (15)
  • [50] Feature Learning With Matrix Factorization Applied to Acoustic Scene Classification
    Bisot, Victor
    Serizel, Romain
    Essid, Slim
    Richard, Gael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1216 - 1229