A self-supervised anomalous machine sound detection model based on spectrogram decomposition and parallel sub-network

被引:0
|
作者
Zhang, Tao [1 ]
Kong, Lingguo [1 ]
Zhao, Xin [1 ]
Li, Donglei [1 ]
Geng, Yanzhang [1 ]
Ding, Biyun [2 ]
Wang, Chao [1 ]
机构
[1] Tianjin Univ, Sch Elect Informat Engn, 92 Weijin Rd, Tianjin 300072, Peoples R China
[2] Nanchang Hangkong Univ, Sch Informat Engn, 696 Fenghe South Ave, Nanchang 330063, Jiangxi, Peoples R China
关键词
Anomalous sound detection; Audio signal processing; Self-supervised learning; Acoustic feature extraction; Domain shift;
D O I
10.1007/s10489-025-06366-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anomalous Sound Detection (ASD) has research significance and application prospect industrial automation. Most existing models of ASD have limited ability to effectively utilize machine sound features, leading to reduced stability against sound anomalies and domain shift variations. To address the above issues, we propose a self-supervised ASD model based on spectrogram decomposition and parallel sub-network in this paper. Firstly, we decompose the spectrogram along the time and frequency dimensions to balance feature size and information integrity. This approach emphasizes the temporal and frequency variations in the feature map, facilitating a better understanding of the factors that affect machine sounds under domain shift conditions. Secondly, we design a pair of parallel training sub-networks. The parallel sub-networks employ self-attention mechanisms and shared gradients to effectively capture changes in features across both time and frequency dimensions. This approach improves model stability against anomalies and domain shifts. Finally, the anomaly scores of sub-network branches are fused as anomalous detection results. The performance of the proposed model is validated on DCASE2022 Task2 dataset. The Area under the Receiver Operating Characteristic Curve (AUC) and partial AUC (pAUC) of our model reached 72.89% and 64.83%. The results confirm the effectiveness of the proposed model, achieving better performance.
引用
收藏
页数:18
相关论文
共 46 条
  • [21] Anomal-E: A self-supervised network intrusion detection system based on graph neural networks
    Caville, Evan
    Lo, Wai Weng
    Layeghy, Siamak
    Portmann, Marius
    KNOWLEDGE-BASED SYSTEMS, 2022, 258
  • [22] STMNet: Single-Temporal Mask-Based Network for Self-Supervised Hyperspectral Change Detection
    Zhou, Tianyuan
    Luo, Fulin
    Fu, Chuan
    Guo, Tan
    Wang, Xiaopan
    Du, Bo
    Gao, Xinbo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [23] Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds
    Zhang, Hejing
    Guan, Jian
    Zhu, Qiaoxi
    Xiao, Feiyang
    Liu, Youde
    INTERSPEECH 2023, 2023, : 336 - 340
  • [24] MeshKINN: A self-supervised mesh generation model based on Kolmogorov-Arnold-Informed neural network
    Zhang, Haoxuan
    Wang, Min
    Li, Haisheng
    Li, Nan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 274
  • [25] ASiT-CRNN: A method for sound event detection with fine-tuning of self-supervised pre-trained ASiT-based model
    Zheng, Yueyang
    Zhang, Ruikun
    Atito, Sara
    Yang, Shuguo
    Wang, Wenwu
    Mei, Yiduo
    DIGITAL SIGNAL PROCESSING, 2025, 160
  • [26] Self-Supervised Learning-Based General Laboratory Progress Pretrained Model for Cardiovascular Event Detection
    Chen, Li-Chin
    Hung, Kuo-Hsuan
    Tseng, Yi-Ju
    Wang, Hsin-Yao
    Lu, Tse-Min
    Huang, Wei-Chieh
    Tsao, Yu
    IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2024, 12 : 43 - 55
  • [27] Self-supervised denoising model based on deep audio prior using single noisy marine mammal sound sample
    Zhu, Jifeng
    Cai, Wenyu
    Zhang, Meiyan
    Yang, Yong
    APPLIED INTELLIGENCE, 2023, 53 (21) : 25697 - 25714
  • [28] Self-supervised denoising model based on deep audio prior using single noisy marine mammal sound sample
    Jifeng Zhu
    Wenyu Cai
    Meiyan Zhang
    Yong Yang
    Applied Intelligence, 2023, 53 : 25697 - 25714
  • [29] SAR-Optical Image Matching Using Self-Supervised Detection and a Transformer-CNN-Based Network
    Liu, Yijun
    Lin, Mingxin
    Mo, Yuanhui
    Wang, Qingsong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [30] FIAD net: a Fast SAR ship detection network based on feature integration attention and self-supervised learning
    Wang, Deyi
    Zhang, Chengkun
    Han, Min
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (04) : 1485 - 1513