A self-supervised anomalous machine sound detection model based on spectrogram decomposition and parallel sub-network

被引:0
|
作者
Zhang, Tao [1 ]
Kong, Lingguo [1 ]
Zhao, Xin [1 ]
Li, Donglei [1 ]
Geng, Yanzhang [1 ]
Ding, Biyun [2 ]
Wang, Chao [1 ]
机构
[1] Tianjin Univ, Sch Elect Informat Engn, 92 Weijin Rd, Tianjin 300072, Peoples R China
[2] Nanchang Hangkong Univ, Sch Informat Engn, 696 Fenghe South Ave, Nanchang 330063, Jiangxi, Peoples R China
关键词
Anomalous sound detection; Audio signal processing; Self-supervised learning; Acoustic feature extraction; Domain shift;
D O I
10.1007/s10489-025-06366-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomalous Sound Detection (ASD) has research significance and application prospect industrial automation. Most existing models of ASD have limited ability to effectively utilize machine sound features, leading to reduced stability against sound anomalies and domain shift variations. To address the above issues, we propose a self-supervised ASD model based on spectrogram decomposition and parallel sub-network in this paper. Firstly, we decompose the spectrogram along the time and frequency dimensions to balance feature size and information integrity. This approach emphasizes the temporal and frequency variations in the feature map, facilitating a better understanding of the factors that affect machine sounds under domain shift conditions. Secondly, we design a pair of parallel training sub-networks. The parallel sub-networks employ self-attention mechanisms and shared gradients to effectively capture changes in features across both time and frequency dimensions. This approach improves model stability against anomalies and domain shifts. Finally, the anomaly scores of sub-network branches are fused as anomalous detection results. The performance of the proposed model is validated on DCASE2022 Task2 dataset. The Area under the Receiver Operating Characteristic Curve (AUC) and partial AUC (pAUC) of our model reached 72.89% and 64.83%. The results confirm the effectiveness of the proposed model, achieving better performance.
引用
收藏
页数:18
相关论文
共 46 条
  • [1] Machine Anomalous Sound Detection Based on Self-Supervised Classification
    Wang, Shuxian
    Du, Jun
    Wang, Yajian
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 449 - 454
  • [2] SELF-SUPERVISED LEARNING FOR ANOMALOUS SOUND DETECTION
    Wilkinghoff, Kevin
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 276 - 280
  • [3] Self-supervised Complex Network for Machine Sound Anomaly Detection
    Kim, Miseul
    Minh Tri Ho
    Kang, Hong-Goo
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 586 - 590
  • [4] ASDNet: An Efficient Self-Supervised Convolutional Network for Anomalous Sound Detection
    Kong, Dewei
    Yuan, Guoshun
    Yu, Hongjiang
    Wang, Shuai
    Zhang, Bo
    APPLIED SCIENCES-BASEL, 2025, 15 (02):
  • [5] FLOW-BASED SELF-SUPERVISED DENSITY ESTIMATION FOR ANOMALOUS SOUND DETECTION
    Dohi, Kota
    Endo, Takashi
    Purohit, Harsh
    Tanabe, Ryo
    Kawaguchi, Yohei
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 336 - 340
  • [6] SSDPT: Self-supervised dual-path transformer for anomalous sound detection
    Bai, Jisheng
    Chen, Jianfeng
    Wang, Mou
    Ayub, Muhammad Saad
    Yan, Qingli
    DIGITAL SIGNAL PROCESSING, 2023, 135
  • [7] Anomalous Sub-Trajectory Detection With Graph Contrastive Self-Supervised Learning
    Kong, Xiangjie
    Lin, Hang
    Jiang, Renhe
    Shen, Guojiang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (07) : 9800 - 9811
  • [8] Anomalous Sound Detection Using Self-Supervised Classification Deep Hierarchical Reconstruction Network with Symmetric Fusion Attention
    Wang, Hui
    Shen, Kuan
    Wang, Fuquan
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025,
  • [9] SELF-SUPERVISED REPRESENTATION LEARNING FOR UNSUPERVISED ANOMALOUS SOUND DETECTION UNDER DOMAIN SHIFT
    Chen, Han
    Song, Yan
    Dai, Li-Rong
    McLoughlin, Ian
    Liu, Lin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 471 - 475
  • [10] Network Intrusion Detection Model Based on Improved BYOL Self-Supervised Learning
    Wang, Zhendong
    Li, Zeyu
    Wang, Junling
    Li, Dahai
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021