Anomalous sound detection for machine condition monitoring using 3D tensor representation of sound and 3D deep convolutional neural network

被引:0
|
作者
Mohsen Khanjari
Azita Azarfar
Mohamad Hosseini Abardeh
Esmail Alibeiki
机构
[1] Islamic Azad University,Department of Electrical and Computer Engineering, Shahrood Branch
来源
关键词
Anomalous sound detection; 3D tensor representation; 3D deep convolutional neural network;
D O I
暂无
中图分类号
学科分类号
摘要
This study introduces a novel approach that utilizes a three-dimensional tensor representation of machine-generated audio signals, serving as a suitable input for a three-dimensional convolutional neural network. The proposed method involves calculating the reconstructed phase space of the audio signal, followed by converting the resulting three-dimensional reconstructed phase space into a three-dimensional tensor format. This technique offers superiority by capturing nonlinear dynamic features and uncovering hidden system variables, which can improve discrimination and classification, enabling accurate detection of anomalous sound patterns, with valuable information encoded in the shape of the data cloud within the tensors. Subsequently, these tensors are employed as input to a three-dimensional deep convolutional neural network, facilitating effective analysis and classification of the audio signals. To assess the effectiveness of the proposed method, we conduct a comprehensive evaluation on three benchmark datasets: MFPT, MIMII, and ToyADAMOS, employing a 5-fold cross-validation scheme. The evaluation metrics employed include Sensitivity, Specificity, Accuracy, and F1 Score to ensure a thorough examination of the method's performance across diverse datasets, encompassing different machine types and acoustic environments. The experimental results showed a high average accuracy of 97.63% on the MFPT dataset. However, in the MIMII dataset, the slider machinery achieved the highest average accuracy rate of 92.02%, while the pump machinery had the lowest average accuracy rate of 90.54%. For the ToyADAMOS dataset, an average accuracy rate of approximately 94% was obtained. These findings underscore the method's potential for accurately detecting anomalies across various machine types and acoustic environments.
引用
收藏
页码:44101 / 44119
页数:18
相关论文
共 50 条
  • [42] 3D Sketch-based 3D Model Retrieval with Convolutional Neural Network
    Ye, Yuxiang
    Li, Bo
    Lu, Yijuan
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2936 - 2941
  • [43] 3D Convolutional Neural Network for Action Recognition
    Zhang, Junhui
    Chen, Li
    Tian, Jing
    COMPUTER VISION, PT I, 2017, 771 : 600 - 607
  • [44] 3D SOUND FOR CONCERT HALL
    del Cerro, Emiliano
    Ma Ortiz, Silvia
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONGRESS ON SOUND AND VIBRATION: MAJOR CHALLENGES IN ACOUSTICS, NOISE AND VIBRATION RESEARCH, 2015, 2015,
  • [45] Embedding sound in 3D graphics
    Akhan, MB
    Bahari, EG
    IBC - INTERNATIONAL BROADCASTING CONVENTION, 1997, (447): : 278 - 283
  • [46] SONOLITHOGRAPHY: 3D PRINTING WITH SOUND
    不详
    ADVANCED MATERIALS & PROCESSES, 2021, 179 (04): : 80 - 80
  • [47] 3D image processing using deep neural network
    Fujii, Toshiaki
    THREE-DIMENSIONAL IMAGING, VISUALIZATION, AND DISPLAY 2019, 2019, 10997
  • [48] Multiprocessor 3D sound system
    El-Sharkawy, M
    Guillen, N
    Eshmawy, W
    Langhorst, B
    Gundrum, H
    Judd, D
    Auerbach, R
    40TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1 AND 2, 1998, : 798 - 801
  • [49] Simulation of 3D moving sound
    QIU Yujing
    Jenison R L
    Hu Y H
    Reate R A
    Brugge J F (Departments of Electrical and Computer Engineering
    ChineseJournalofAcoustics, 1999, (02) : 146 - 151
  • [50] Beam Search for Learning a Deep Convolutional Neural Network of 3D Shapes
    Xu, Xu
    Todorovic, Sinisa
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3506 - 3511