A self-supervised anomalous machine sound detection model based on spectrogram decomposition and parallel sub-network

被引：0

作者：

Zhang, Tao ^{[1
]}

Kong, Lingguo ^{[1
]}

Zhao, Xin ^{[1
]}

Li, Donglei ^{[1
]}

Geng, Yanzhang ^{[1
]}

Ding, Biyun ^{[2
]}

Wang, Chao ^{[1
]}

机构：

[1] Tianjin Univ, Sch Elect Informat Engn, 92 Weijin Rd, Tianjin 300072, Peoples R China

[2] Nanchang Hangkong Univ, Sch Informat Engn, 696 Fenghe South Ave, Nanchang 330063, Jiangxi, Peoples R China

来源：

APPLIED INTELLIGENCE | 2025年 / 55卷 / 06期

关键词：

Anomalous sound detection; Audio signal processing; Self-supervised learning; Acoustic feature extraction; Domain shift;

D O I：

10.1007/s10489-025-06366-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Anomalous Sound Detection (ASD) has research significance and application prospect industrial automation. Most existing models of ASD have limited ability to effectively utilize machine sound features, leading to reduced stability against sound anomalies and domain shift variations. To address the above issues, we propose a self-supervised ASD model based on spectrogram decomposition and parallel sub-network in this paper. Firstly, we decompose the spectrogram along the time and frequency dimensions to balance feature size and information integrity. This approach emphasizes the temporal and frequency variations in the feature map, facilitating a better understanding of the factors that affect machine sounds under domain shift conditions. Secondly, we design a pair of parallel training sub-networks. The parallel sub-networks employ self-attention mechanisms and shared gradients to effectively capture changes in features across both time and frequency dimensions. This approach improves model stability against anomalies and domain shifts. Finally, the anomaly scores of sub-network branches are fused as anomalous detection results. The performance of the proposed model is validated on DCASE2022 Task2 dataset. The Area under the Receiver Operating Characteristic Curve (AUC) and partial AUC (pAUC) of our model reached 72.89% and 64.83%. The results confirm the effectiveness of the proposed model, achieving better performance.

引用

页数：18

共 46 条

[1] Machine Anomalous Sound Detection Based on Self-Supervised Classification
Wang, Shuxian
Du, Jun
Wang, Yajian
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 449 - 454
[2] SELF-SUPERVISED LEARNING FOR ANOMALOUS SOUND DETECTION
Wilkinghoff, Kevin
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 276 - 280
[3] Self-supervised Complex Network for Machine Sound Anomaly Detection
Kim, Miseul
Minh Tri Ho
Kang, Hong-Goo
29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 586 - 590
[4] ASDNet: An Efficient Self-Supervised Convolutional Network for Anomalous Sound Detection
Kong, Dewei
Yuan, Guoshun
Yu, Hongjiang
Wang, Shuai
Zhang, Bo
APPLIED SCIENCES-BASEL, 2025, 15 (02):
[5] FLOW-BASED SELF-SUPERVISED DENSITY ESTIMATION FOR ANOMALOUS SOUND DETECTION
Dohi, Kota
Endo, Takashi
Purohit, Harsh
Tanabe, Ryo
Kawaguchi, Yohei
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 336 - 340
[6] SSDPT: Self-supervised dual-path transformer for anomalous sound detection
Bai, Jisheng
Chen, Jianfeng
Wang, Mou
Ayub, Muhammad Saad
Yan, Qingli
DIGITAL SIGNAL PROCESSING, 2023, 135
[7] Anomalous Sub-Trajectory Detection With Graph Contrastive Self-Supervised Learning
Kong, Xiangjie
Lin, Hang
Jiang, Renhe
Shen, Guojiang
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (07) : 9800 - 9811
[8] Anomalous Sound Detection Using Self-Supervised Classification Deep Hierarchical Reconstruction Network with Symmetric Fusion Attention
Wang, Hui
Shen, Kuan
Wang, Fuquan
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025,
[9] SELF-SUPERVISED REPRESENTATION LEARNING FOR UNSUPERVISED ANOMALOUS SOUND DETECTION UNDER DOMAIN SHIFT
Chen, Han
Song, Yan
Dai, Li-Rong
McLoughlin, Ian
Liu, Lin
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 471 - 475
[10] Network Intrusion Detection Model Based on Improved BYOL Self-Supervised Learning
Wang, Zhendong
Li, Zeyu
Wang, Junling
Li, Dahai
SECURITY AND COMMUNICATION NETWORKS, 2021, 2021

← 1 2 3 4 5 →