Anomalous Sound Detection Using Self-Supervised Classification Deep Hierarchical Reconstruction Network with Symmetric Fusion Attention

被引：0

作者：

Wang, Hui ^{[1
]}

Shen, Kuan ^{[1
]}

Wang, Fuquan ^{[1
]}

机构：

[1] Chongqing Univ, Coll Optoelect Engn, Chongqing 400044, Peoples R China

来源：

CIRCUITS SYSTEMS AND SIGNAL PROCESSING | 2025年

关键词：

Anomalous sound detection; Generative model; Discriminative model; Attention mechanism; Center loss;

D O I：

10.1007/s00034-025-03064-2

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The main objective of unsupervised anomalous sound detection (ASD) is to identify anomalous sound events among normal sound samples. Existing ASD methods primarily rely on generative and discriminative models. Among them, the autoencoder (AE) based on generative models is widely used for anomaly detection. However, due to the 'shortcut' problem, it often misclassifies abnormal samples as normal. In contrast, discriminative model-based methods, while exhibiting good performance, often suffer from poor stability. This research introduces an architecture named the self-supervised classification deep hierarchical reconstruction network (SCDHR), which combines generative and discriminative model structures. The system uses convolutional kernels of varying sizes across different branches to process input data, aiming to extract more discriminative features. Additionally, a module called symmetric fusion attention (SFA) is introduced. This module enhances the model's ability to process input by integrating attention mechanisms for time, frequency, and coordinate across different branches. As a result, the model's ability to select relevant features is improved. Furthermore, the one class center loss is incorporated and combined with the standard center loss to obtain more compact feature representations, thereby enhancing the model's ability to distinguish anomalous samples. Finally, the proposed method is validated on the DCASE 2023 TASK 2 dataset, achieving a harmonic mean of 65.17% for AUC and PAUC on the Development Dataset and 68.16% on the Evaluation Dataset, outperforming state-of-the-art methods.

引用

页数：26

共 50 条

[41] Self-Supervised Convolutional Neural Network via Spectral Attention Module for Hyperspectral Image Classification
Huang, Hong
Luo, Liuyang
Pu, Chunyu
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[42] Attention fusion network with self-supervised learning for staging of osteonecrosis of the femoral head (ONFH) using multiple MR protocols
Kim, Bomin
Lee, Geun Young
Park, Sung-Hong
MEDICAL PHYSICS, 2023, 50 (09) : 5528 - 5540
[43] Self-supervised fusion network for RGB-D interest point detection and description
Li, Ningning
Wang, Xiaomin
Zheng, Zhou
Sun, Zhendong
PATTERN RECOGNITION, 2025, 158
[44] DAN-SuperPoint: Self-Supervised Feature Point Detection Algorithm with Dual Attention Network
Li, Zhaoyang
Cao, Jie
Hao, Qun
Zhao, Xue
Ning, Yaqian
Li, Dongxing
SENSORS, 2022, 22 (05)
[45] Suppressing label noise in medical image classification using mixup attention and self-supervised learning
Gao, Mengdi
Jiang, Hongyang
Hu, Yan
Ren, Qiushi
Xie, Zhaoheng
Liu, Jiang
PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (10):
[46] Shadow detection using a cross-attentional dual-decoder network with self-supervised image reconstruction features
Fernandez-Beltran, Ruben
Guzman-Ponce, Angelica
Fernandez, Rafael
Kang, Jian
Garcia-Mateos, Gines
IMAGE AND VISION COMPUTING, 2024, 143
[47] Self-supervised Body Image Acquisition Using a Deep Neural Network for Sensorimotor Prediction
Laflaquiere, Alban
Hafner, Verena V.
2019 JOINT IEEE 9TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (ICDL-EPIROB), 2019, : 117 - 122
[48] A deep neural network for the classification of epileptic seizures using hierarchical attention mechanism
Chirasani, Sateesh Kumar Reddy
Manikandan, Suchetha
SOFT COMPUTING, 2022, 26 (11) : 5389 - 5397
[49] Speaker recognition using isomorphic graph attention network based pooling on self-supervised representation *
Ge, Zirui
Xu, Xinzhou
Guo, Haiyan
Wang, Tingting
Yang, Zhen
APPLIED ACOUSTICS, 2024, 219
[50] Noise-based self-supervised anomaly detection in washing machines using a deep neural network with operational information
Shul, Yusun
Yi, Wonjun
Choi, Jihoon
Kang, Dong-Soo
Choi, Jung-Woo
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2023, 189

← 1 2 3 4 5 →