Anomalous sound detection for machine condition monitoring using 3D tensor representation of sound and 3D deep convolutional neural network

被引：0

作者：

Mohsen Khanjari

Azita Azarfar

Mohamad Hosseini Abardeh

Esmail Alibeiki

机构：

[1] Islamic Azad University,Department of Electrical and Computer Engineering, Shahrood Branch

来源：

Multimedia Tools and Applications | 2024年 / 83卷

关键词：

Anomalous sound detection; 3D tensor representation; 3D deep convolutional neural network;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This study introduces a novel approach that utilizes a three-dimensional tensor representation of machine-generated audio signals, serving as a suitable input for a three-dimensional convolutional neural network. The proposed method involves calculating the reconstructed phase space of the audio signal, followed by converting the resulting three-dimensional reconstructed phase space into a three-dimensional tensor format. This technique offers superiority by capturing nonlinear dynamic features and uncovering hidden system variables, which can improve discrimination and classification, enabling accurate detection of anomalous sound patterns, with valuable information encoded in the shape of the data cloud within the tensors. Subsequently, these tensors are employed as input to a three-dimensional deep convolutional neural network, facilitating effective analysis and classification of the audio signals. To assess the effectiveness of the proposed method, we conduct a comprehensive evaluation on three benchmark datasets: MFPT, MIMII, and ToyADAMOS, employing a 5-fold cross-validation scheme. The evaluation metrics employed include Sensitivity, Specificity, Accuracy, and F1 Score to ensure a thorough examination of the method's performance across diverse datasets, encompassing different machine types and acoustic environments. The experimental results showed a high average accuracy of 97.63% on the MFPT dataset. However, in the MIMII dataset, the slider machinery achieved the highest average accuracy rate of 92.02%, while the pump machinery had the lowest average accuracy rate of 90.54%. For the ToyADAMOS dataset, an average accuracy rate of approximately 94% was obtained. These findings underscore the method's potential for accurately detecting anomalies across various machine types and acoustic environments.

引用

页码：44101 / 44119

页数：18

共 50 条

[41] Machine condition monitoring: Standardization of the 3D print
Konstruktion, 2021, 2021 (10):
[42] 3D Sketch-based 3D Model Retrieval with Convolutional Neural Network
Ye, Yuxiang
Li, Bo
Lu, Yijuan
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2936 - 2941
[43] 3D Convolutional Neural Network for Action Recognition
Zhang, Junhui
Chen, Li
Tian, Jing
COMPUTER VISION, PT I, 2017, 771 : 600 - 607
[44] 3D SOUND FOR CONCERT HALL
del Cerro, Emiliano
Ma Ortiz, Silvia
PROCEEDINGS OF THE 22ND INTERNATIONAL CONGRESS ON SOUND AND VIBRATION: MAJOR CHALLENGES IN ACOUSTICS, NOISE AND VIBRATION RESEARCH, 2015, 2015,
[45] Embedding sound in 3D graphics
Akhan, MB
Bahari, EG
IBC - INTERNATIONAL BROADCASTING CONVENTION, 1997, (447): : 278 - 283
[46] SONOLITHOGRAPHY: 3D PRINTING WITH SOUND
不详
ADVANCED MATERIALS & PROCESSES, 2021, 179 (04): : 80 - 80
[47] 3D image processing using deep neural network
Fujii, Toshiaki
THREE-DIMENSIONAL IMAGING, VISUALIZATION, AND DISPLAY 2019, 2019, 10997
[48] Multiprocessor 3D sound system
El-Sharkawy, M
Guillen, N
Eshmawy, W
Langhorst, B
Gundrum, H
Judd, D
Auerbach, R
40TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1 AND 2, 1998, : 798 - 801
[49] Simulation of 3D moving sound
QIU Yujing
Jenison R L
Hu Y H
Reate R A
Brugge J F (Departments of Electrical and Computer Engineering
ChineseJournalofAcoustics, 1999, (02) : 146 - 151
[50] Beam Search for Learning a Deep Convolutional Neural Network of 3D Shapes
Xu, Xu
Todorovic, Sinisa
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3506 - 3511

← 1 2 3 4 5 →