Anomalous Sound Detection Using Self-Supervised Classification Deep Hierarchical Reconstruction Network with Symmetric Fusion Attention

被引:0
|
作者
Wang, Hui [1 ]
Shen, Kuan [1 ]
Wang, Fuquan [1 ]
机构
[1] Chongqing Univ, Coll Optoelect Engn, Chongqing 400044, Peoples R China
关键词
Anomalous sound detection; Generative model; Discriminative model; Attention mechanism; Center loss;
D O I
10.1007/s00034-025-03064-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The main objective of unsupervised anomalous sound detection (ASD) is to identify anomalous sound events among normal sound samples. Existing ASD methods primarily rely on generative and discriminative models. Among them, the autoencoder (AE) based on generative models is widely used for anomaly detection. However, due to the 'shortcut' problem, it often misclassifies abnormal samples as normal. In contrast, discriminative model-based methods, while exhibiting good performance, often suffer from poor stability. This research introduces an architecture named the self-supervised classification deep hierarchical reconstruction network (SCDHR), which combines generative and discriminative model structures. The system uses convolutional kernels of varying sizes across different branches to process input data, aiming to extract more discriminative features. Additionally, a module called symmetric fusion attention (SFA) is introduced. This module enhances the model's ability to process input by integrating attention mechanisms for time, frequency, and coordinate across different branches. As a result, the model's ability to select relevant features is improved. Furthermore, the one class center loss is incorporated and combined with the standard center loss to obtain more compact feature representations, thereby enhancing the model's ability to distinguish anomalous samples. Finally, the proposed method is validated on the DCASE 2023 TASK 2 dataset, achieving a harmonic mean of 65.17% for AUC and PAUC on the Development Dataset and 68.16% on the Evaluation Dataset, outperforming state-of-the-art methods.
引用
收藏
页数:26
相关论文
共 50 条
  • [41] Self-Supervised Convolutional Neural Network via Spectral Attention Module for Hyperspectral Image Classification
    Huang, Hong
    Luo, Liuyang
    Pu, Chunyu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [42] Attention fusion network with self-supervised learning for staging of osteonecrosis of the femoral head (ONFH) using multiple MR protocols
    Kim, Bomin
    Lee, Geun Young
    Park, Sung-Hong
    MEDICAL PHYSICS, 2023, 50 (09) : 5528 - 5540
  • [43] Self-supervised fusion network for RGB-D interest point detection and description
    Li, Ningning
    Wang, Xiaomin
    Zheng, Zhou
    Sun, Zhendong
    PATTERN RECOGNITION, 2025, 158
  • [44] DAN-SuperPoint: Self-Supervised Feature Point Detection Algorithm with Dual Attention Network
    Li, Zhaoyang
    Cao, Jie
    Hao, Qun
    Zhao, Xue
    Ning, Yaqian
    Li, Dongxing
    SENSORS, 2022, 22 (05)
  • [45] Suppressing label noise in medical image classification using mixup attention and self-supervised learning
    Gao, Mengdi
    Jiang, Hongyang
    Hu, Yan
    Ren, Qiushi
    Xie, Zhaoheng
    Liu, Jiang
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (10):
  • [46] Shadow detection using a cross-attentional dual-decoder network with self-supervised image reconstruction features
    Fernandez-Beltran, Ruben
    Guzman-Ponce, Angelica
    Fernandez, Rafael
    Kang, Jian
    Garcia-Mateos, Gines
    IMAGE AND VISION COMPUTING, 2024, 143
  • [47] Self-supervised Body Image Acquisition Using a Deep Neural Network for Sensorimotor Prediction
    Laflaquiere, Alban
    Hafner, Verena V.
    2019 JOINT IEEE 9TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (ICDL-EPIROB), 2019, : 117 - 122
  • [48] A deep neural network for the classification of epileptic seizures using hierarchical attention mechanism
    Chirasani, Sateesh Kumar Reddy
    Manikandan, Suchetha
    SOFT COMPUTING, 2022, 26 (11) : 5389 - 5397
  • [49] Speaker recognition using isomorphic graph attention network based pooling on self-supervised representation *
    Ge, Zirui
    Xu, Xinzhou
    Guo, Haiyan
    Wang, Tingting
    Yang, Zhen
    APPLIED ACOUSTICS, 2024, 219
  • [50] Noise-based self-supervised anomaly detection in washing machines using a deep neural network with operational information
    Shul, Yusun
    Yi, Wonjun
    Choi, Jihoon
    Kang, Dong-Soo
    Choi, Jung-Woo
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2023, 189