ConvLSTM-based Sound Source Localization in a manufacturing workplace

被引:1
|
作者
Jalayer, Reza [1 ]
Jalayer, Masoud [1 ]
Mor, Andrea [1 ]
Orsenigo, Carlotta [1 ]
Vercellis, Carlo [1 ]
机构
[1] Politecn Milan, Dept Management Econ & Ind Engn, Via Lambruschini 4-b, I-20156 Milan, Italy
关键词
Industry; 5.0; Smart manufacturing; Sound source localization; Convolutional LSTM; Multiple sound sources; Moving sound sources; DOA ESTIMATION; CRNN;
D O I
10.1016/j.cie.2024.110213
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, Sound Source Localization (SSL) is explored as an approach to localize both human operators and machines emitting sound signals in a manufacturing workplace. In particular, a comprehensive analysis of the source localization ability of a state-of-the-art deep learning architecture in environments of increasing complexity is presented. Scenarios including single, dual, and multiple sound sources, in the form of both human and Computerized Numerical Control (CNC) machines, are investigated, as well as configurations with a mix of stationary and moving sources. Our work contributes to the extant literature by enriching previous research findings primarily devoted to single stationary sources. Furthermore, by focusing on the simultaneous and centralized detection of sources of different nature and type, it diverges from traditional SSL studies in manufacturing, which emphasize the localization of humans by robots in human-robot interaction, and presents a localization approach which enables a broader control over the workspace. For the localization task, a Convolutional LSTM architecture able to capture both spatial and temporal sound characteristics is also proposed, with each source assigned a dedicated model. Extensive experiments were carried out for each scenario in a simulated environment, where different levels of noise were also applied. The results showed the remarkable accuracy and robustness of the deep learning models when it comes to localizing single and dual stationary sources, as well as single moving sources. For multiple stationary and moving sources a general decline in the detection performance was observed, alongside a heightened sensitivity to noise.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet
    Guan, Duanzheng
    Li, Dengshi
    Cai, Xuebei
    Wang, Xiaochen
    Hu, Ruimin
    MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 189 - 200
  • [42] An overview of sound source localization based condition monitoring robots
    Lv, Dong
    Tang, Weijie
    Feng, Guojin
    Zhen, Dong
    Gu, Fengshou
    Ball, Andrew D.
    ISA TRANSACTIONS, 2025, 158 : 537 - 555
  • [43] Binaural Sound Source Localization Based on Convolutional Neural Network
    Zhou, Lin
    Ma, Kangyu
    Wang, Lijie
    Chen, Ying
    Tang, Yibin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 60 (02): : 545 - 557
  • [44] Sound Source Localization Based on B-Format Signals
    Khaddour, Hasan
    2011 34TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2011, : 335 - 338
  • [45] Sound source localization technology based on small microphone array
    Engineering Laboratory for Key Technology and System Integration of Internet of Things, Harbin Institute of Technology Shenzhen Graduate School, Guangdong, Shenzhen, 518055, China
    不详
    Huazhong Ligong Daxue Xuebao, SUPPL.I (188-191):
  • [46] Binaural sound source localization based on weighted template matching
    Liu, Hong
    Sun, Yongheng
    Yang, Ge
    Chen, Yang
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2021, 6 (02) : 214 - 223
  • [47] Sound source localization in real sound fields based on empirical statistics of interaural parameters
    Nix, Johannes
    Hohmann, Volker
    1600, Acoustical Society of America (119):
  • [48] A new sound source location algorithm based on formant frequency for sound image localization
    Obata, K
    Noguchi, K
    Tadokoro, Y
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 729 - 732
  • [49] Sound source localization in real sound fields based on empirical statistics of interaural parameters
    Nix, J
    Hohmann, V
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (01): : 463 - 479
  • [50] Influence of sound source width on human sound localization
    Greene, Nathaniel T.
    Paige, Gary D.
    2012 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2012, : 6455 - 6458