A self-supervised anomalous machine sound detection model based on spectrogram decomposition and parallel sub-network

被引：0

作者：

Zhang, Tao ^{[1
]}

Kong, Lingguo ^{[1
]}

Zhao, Xin ^{[1
]}

Li, Donglei ^{[1
]}

Geng, Yanzhang ^{[1
]}

Ding, Biyun ^{[2
]}

Wang, Chao ^{[1
]}

机构：

[1] Tianjin Univ, Sch Elect Informat Engn, 92 Weijin Rd, Tianjin 300072, Peoples R China

[2] Nanchang Hangkong Univ, Sch Informat Engn, 696 Fenghe South Ave, Nanchang 330063, Jiangxi, Peoples R China

来源：

APPLIED INTELLIGENCE | 2025年 / 55卷 / 06期

关键词：

Anomalous sound detection; Audio signal processing; Self-supervised learning; Acoustic feature extraction; Domain shift;

D O I：

10.1007/s10489-025-06366-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Anomalous Sound Detection (ASD) has research significance and application prospect industrial automation. Most existing models of ASD have limited ability to effectively utilize machine sound features, leading to reduced stability against sound anomalies and domain shift variations. To address the above issues, we propose a self-supervised ASD model based on spectrogram decomposition and parallel sub-network in this paper. Firstly, we decompose the spectrogram along the time and frequency dimensions to balance feature size and information integrity. This approach emphasizes the temporal and frequency variations in the feature map, facilitating a better understanding of the factors that affect machine sounds under domain shift conditions. Secondly, we design a pair of parallel training sub-networks. The parallel sub-networks employ self-attention mechanisms and shared gradients to effectively capture changes in features across both time and frequency dimensions. This approach improves model stability against anomalies and domain shifts. Finally, the anomaly scores of sub-network branches are fused as anomalous detection results. The performance of the proposed model is validated on DCASE2022 Task2 dataset. The Area under the Receiver Operating Characteristic Curve (AUC) and partial AUC (pAUC) of our model reached 72.89% and 64.83%. The results confirm the effectiveness of the proposed model, achieving better performance.

引用

页数：18

共 46 条

[41] Attention-Based Spatial and Spectral Network with PCA-Guided Self-Supervised Feature Extraction for Change Detection in Hyperspectral Images
Wang, Zhao
Jiang, Fenlong
Liu, Tongfei
Xie, Fei
Li, Peng
REMOTE SENSING, 2021, 13 (23)
[42] Formulating Parallel Supervised Machine Learning Designs For Anomaly-Based Network Intrusion Detection in Resource Constrained Use Cases
Joshi, Varun
Korah, John
2022 IEEE 19TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2022), 2022, : 748 - 753
[43] CMR Quantification of Myocardial Oxygenation Extraction Fraction Maps Using a Model-based Self-supervised Learning Network: Initial Experience in Patients with Cardiomyopathies
Huang, Qi
Tang, Haoteng
Wang, Keyang
Li, Ran
Zhan, Marcus
Yang, Yang
Woodard, Pamela
Zheng, Jie
CIRCULATION, 2024, 150
[44] Performance analysis of weakly-supervised sound event detection system based on the mean-teacher convolutional recurrent neural network model
Lee, Seokjin
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (02): : 139 - 147
[45] Unsupervised Noise-Resistant Remote-Sensing Image Change Detection: A Self-Supervised Denoising Network-, FCM_SICM-, and EMD Metric-Based Approach
Xie, Jiangling
Li, Yikun
Yang, Shuwen
Li, Xiaojun
REMOTE SENSING, 2024, 16 (17)
[46] A semi-supervised Anti-Fraud model based on integrated XGBoost and BiGRU with self-attention network: an application to internet loan fraud detection
Gorle, Venkata Lakshmi Narayana
Panigrahi, Suvasini
CURRENT SCIENCE, 2024, 126 (02): : 56939 - 56964

← 1 2 3 4 5 →