A self-supervised anomalous machine sound detection model based on spectrogram decomposition and parallel sub-network

被引:0
|
作者
Zhang, Tao [1 ]
Kong, Lingguo [1 ]
Zhao, Xin [1 ]
Li, Donglei [1 ]
Geng, Yanzhang [1 ]
Ding, Biyun [2 ]
Wang, Chao [1 ]
机构
[1] Tianjin Univ, Sch Elect Informat Engn, 92 Weijin Rd, Tianjin 300072, Peoples R China
[2] Nanchang Hangkong Univ, Sch Informat Engn, 696 Fenghe South Ave, Nanchang 330063, Jiangxi, Peoples R China
关键词
Anomalous sound detection; Audio signal processing; Self-supervised learning; Acoustic feature extraction; Domain shift;
D O I
10.1007/s10489-025-06366-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomalous Sound Detection (ASD) has research significance and application prospect industrial automation. Most existing models of ASD have limited ability to effectively utilize machine sound features, leading to reduced stability against sound anomalies and domain shift variations. To address the above issues, we propose a self-supervised ASD model based on spectrogram decomposition and parallel sub-network in this paper. Firstly, we decompose the spectrogram along the time and frequency dimensions to balance feature size and information integrity. This approach emphasizes the temporal and frequency variations in the feature map, facilitating a better understanding of the factors that affect machine sounds under domain shift conditions. Secondly, we design a pair of parallel training sub-networks. The parallel sub-networks employ self-attention mechanisms and shared gradients to effectively capture changes in features across both time and frequency dimensions. This approach improves model stability against anomalies and domain shifts. Finally, the anomaly scores of sub-network branches are fused as anomalous detection results. The performance of the proposed model is validated on DCASE2022 Task2 dataset. The Area under the Receiver Operating Characteristic Curve (AUC) and partial AUC (pAUC) of our model reached 72.89% and 64.83%. The results confirm the effectiveness of the proposed model, achieving better performance.
引用
收藏
页数:18
相关论文
共 46 条
  • [41] Attention-Based Spatial and Spectral Network with PCA-Guided Self-Supervised Feature Extraction for Change Detection in Hyperspectral Images
    Wang, Zhao
    Jiang, Fenlong
    Liu, Tongfei
    Xie, Fei
    Li, Peng
    REMOTE SENSING, 2021, 13 (23)
  • [42] Formulating Parallel Supervised Machine Learning Designs For Anomaly-Based Network Intrusion Detection in Resource Constrained Use Cases
    Joshi, Varun
    Korah, John
    2022 IEEE 19TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2022), 2022, : 748 - 753
  • [43] CMR Quantification of Myocardial Oxygenation Extraction Fraction Maps Using a Model-based Self-supervised Learning Network: Initial Experience in Patients with Cardiomyopathies
    Huang, Qi
    Tang, Haoteng
    Wang, Keyang
    Li, Ran
    Zhan, Marcus
    Yang, Yang
    Woodard, Pamela
    Zheng, Jie
    CIRCULATION, 2024, 150
  • [44] Performance analysis of weakly-supervised sound event detection system based on the mean-teacher convolutional recurrent neural network model
    Lee, Seokjin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (02): : 139 - 147
  • [45] Unsupervised Noise-Resistant Remote-Sensing Image Change Detection: A Self-Supervised Denoising Network-, FCM_SICM-, and EMD Metric-Based Approach
    Xie, Jiangling
    Li, Yikun
    Yang, Shuwen
    Li, Xiaojun
    REMOTE SENSING, 2024, 16 (17)
  • [46] A semi-supervised Anti-Fraud model based on integrated XGBoost and BiGRU with self-attention network: an application to internet loan fraud detection
    Gorle, Venkata Lakshmi Narayana
    Panigrahi, Suvasini
    CURRENT SCIENCE, 2024, 126 (02): : 56939 - 56964