Deep Learning Based Open Set Acoustic Scene Classification

被引：4

作者：

Kwiatkowska, Zuzanna ^{[1
]}

Kalinowski, Benjamin ^{[1
]}

Kosmider, Michal ^{[1
]}

Rykaczewski, Krzysztof ^{[1
]}

机构：

[1] Samsung R&D Inst, Warsaw, Poland

来源：

INTERSPEECH 2020 | 2020年

关键词：

acoustic scenes; open set classification; deep learning; conditional autoencoders;

D O I：

10.21437/Interspeech.2020-3092

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

In this work, we compare the performance of three selected techniques in open set acoustic scenes classification (ASC). We test thresholding of the softmax output of a deep network classifier, which is the most popular technique nowadays employed in ASC. Further we compare the results with the Openmax classifier which is derived from the computer vision field. As the third model, we use the Adapted Class-Conditioned Autoencoder (Adapted C2AE) which is our variation of another computer vision related technique called C2AE. Adapted C2AE encompasses a more fair comparison of the given experiments and simplifies the original inference procedure, making it more applicable in the real-life scenarios. We also analyse two training scenarios: without additional knowledge of unknown classes and another where a limited subset of examples from the unknown classes is available. We find that the C2AE based method outperforms the thresholding and Openmax, obtaining 85.5% Area Under the Receiver Operating Characteristic curve (AUROC) and 66% of open set accuracy on data used in Detection and Classification of Acoustic Scenes and Events Challenge 2019 Task 1C.

引用

页码：1216 / 1220

页数：5

共 50 条

[41] Blood Vessel Delineation in Endoscopic Images with Deep Learning Based Scene Classification
Golhar, Mayank
Iwahori, Yuji
Bhuyan, M. K.
Funahashi, Kenji
Kasugai, Kunio
PATTERN RECOGNITION APPLICATIONS AND METHODS, 2018, 10857 : 147 - 168
[42] A Multichannel-Based Deep Learning Framework for Ocean SAR Scene Classification
Bai, Chengzu
Zhang, Shuo
Wang, Xinning
Wen, Jiaqiang
Li, Chong
APPLIED SCIENCES-BASEL, 2024, 14 (04):
[43] Remote Sensing Image Scene Classification Based on SURF Feature and Deep Learning
Liang, Jinxiang
Dang, Jianwu
Wang, Yangping
Yang, Jingyu
Zhang, Zhenhai
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1128 - 1133
[44] Exemplar based Deep Discriminative and Shareable Feature Learning for scene image classification
Zuo, Zhen
Wang, Gang
Shuai, Bing
Zhao, Lifan
Yang, Qingxiong
PATTERN RECOGNITION, 2015, 48 (10) : 3004 - 3015
[45] ACOUSTIC SCENE CLASSIFICATION USING SPARSE FEATURE LEARNING AND EVENT-BASED POOLING
Lee, Kyogu
Hyung, Ziwon
Nam, Juhan
2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
[46] Open Set Recognition of Communication Signal Modulation Based on Deep Learning
Zhang, Xinliang
Li, Tianyun
Gong, Pei
Liu, Renwei
Zha, Xiong
Tang, Wenqi
IEEE COMMUNICATIONS LETTERS, 2022, 26 (07) : 1588 - 1592
[47] Traffic Scene Analysis and Classification using Deep Learning
Dorrani, Z.
INTERNATIONAL JOURNAL OF ENGINEERING, 2024, 37 (03): : 496 - 502
[48] ACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING
Bisot, Victor
Serizel, Romain
Essid, Slim
Richard, Gael
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6445 - 6449
[49] Online Continual Learning in Acoustic Scene Classification: An Empirical Study
Ha, Donghee
Kim, Mooseop
Jeong, Chi Yoon
SENSORS, 2023, 23 (15)
[50] Feature Learning With Matrix Factorization Applied to Acoustic Scene Classification
Bisot, Victor
Serizel, Romain
Essid, Slim
Richard, Gael
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1216 - 1229

← 1 2 3 4 5 →