A self-supervised anomalous machine sound detection model based on spectrogram decomposition and parallel sub-network

被引：0

作者：

Zhang, Tao ^{[1
]}

Kong, Lingguo ^{[1
]}

Zhao, Xin ^{[1
]}

Li, Donglei ^{[1
]}

Geng, Yanzhang ^{[1
]}

Ding, Biyun ^{[2
]}

Wang, Chao ^{[1
]}

机构：

[1] Tianjin Univ, Sch Elect Informat Engn, 92 Weijin Rd, Tianjin 300072, Peoples R China

[2] Nanchang Hangkong Univ, Sch Informat Engn, 696 Fenghe South Ave, Nanchang 330063, Jiangxi, Peoples R China

来源：

APPLIED INTELLIGENCE | 2025年 / 55卷 / 06期

关键词：

Anomalous sound detection; Audio signal processing; Self-supervised learning; Acoustic feature extraction; Domain shift;

D O I：

10.1007/s10489-025-06366-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Anomalous Sound Detection (ASD) has research significance and application prospect industrial automation. Most existing models of ASD have limited ability to effectively utilize machine sound features, leading to reduced stability against sound anomalies and domain shift variations. To address the above issues, we propose a self-supervised ASD model based on spectrogram decomposition and parallel sub-network in this paper. Firstly, we decompose the spectrogram along the time and frequency dimensions to balance feature size and information integrity. This approach emphasizes the temporal and frequency variations in the feature map, facilitating a better understanding of the factors that affect machine sounds under domain shift conditions. Secondly, we design a pair of parallel training sub-networks. The parallel sub-networks employ self-attention mechanisms and shared gradients to effectively capture changes in features across both time and frequency dimensions. This approach improves model stability against anomalies and domain shifts. Finally, the anomaly scores of sub-network branches are fused as anomalous detection results. The performance of the proposed model is validated on DCASE2022 Task2 dataset. The Area under the Receiver Operating Characteristic Curve (AUC) and partial AUC (pAUC) of our model reached 72.89% and 64.83%. The results confirm the effectiveness of the proposed model, achieving better performance.

引用

页数：18

共 46 条

[21] Anomal-E: A self-supervised network intrusion detection system based on graph neural networks
Caville, Evan
Lo, Wai Weng
Layeghy, Siamak
Portmann, Marius
KNOWLEDGE-BASED SYSTEMS, 2022, 258
[22] STMNet: Single-Temporal Mask-Based Network for Self-Supervised Hyperspectral Change Detection
Zhou, Tianyuan
Luo, Fulin
Fu, Chuan
Guo, Tan
Wang, Xiaopan
Du, Bo
Gao, Xinbo
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
[23] Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds
Zhang, Hejing
Guan, Jian
Zhu, Qiaoxi
Xiao, Feiyang
Liu, Youde
INTERSPEECH 2023, 2023, : 336 - 340
[24] MeshKINN: A self-supervised mesh generation model based on Kolmogorov-Arnold-Informed neural network
Zhang, Haoxuan
Wang, Min
Li, Haisheng
Li, Nan
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 274
[25] ASiT-CRNN: A method for sound event detection with fine-tuning of self-supervised pre-trained ASiT-based model
Zheng, Yueyang
Zhang, Ruikun
Atito, Sara
Yang, Shuguo
Wang, Wenwu
Mei, Yiduo
DIGITAL SIGNAL PROCESSING, 2025, 160
[26] Self-Supervised Learning-Based General Laboratory Progress Pretrained Model for Cardiovascular Event Detection
Chen, Li-Chin
Hung, Kuo-Hsuan
Tseng, Yi-Ju
Wang, Hsin-Yao
Lu, Tse-Min
Huang, Wei-Chieh
Tsao, Yu
IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2024, 12 : 43 - 55
[27] Self-supervised denoising model based on deep audio prior using single noisy marine mammal sound sample
Zhu, Jifeng
Cai, Wenyu
Zhang, Meiyan
Yang, Yong
APPLIED INTELLIGENCE, 2023, 53 (21) : 25697 - 25714
[28] Self-supervised denoising model based on deep audio prior using single noisy marine mammal sound sample
Jifeng Zhu
Wenyu Cai
Meiyan Zhang
Yong Yang
Applied Intelligence, 2023, 53 : 25697 - 25714
[29] SAR-Optical Image Matching Using Self-Supervised Detection and a Transformer-CNN-Based Network
Liu, Yijun
Lin, Mingxin
Mo, Yuanhui
Wang, Qingsong
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
[30] FIAD net: a Fast SAR ship detection network based on feature integration attention and self-supervised learning
Wang, Deyi
Zhang, Chengkun
Han, Min
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (04) : 1485 - 1513

← 1 2 3 4 5 →