Deep Learning-Based Portable Device for Audio Distress Signal Recognition in Urban Areas

被引:6
|
作者
Felipe Gaviria, Jorge [1 ]
Escalante-Perez, Alejandra [1 ]
Camilo Castiblanco, Juan [1 ]
Vergara, Nicolas [1 ]
Parra-Garces, Valentina [1 ]
David Serrano, Juan [1 ]
Felipe Zambrano, Andres [1 ]
Felipe Giraldo, Luis [1 ]
机构
[1] Univ Los Andes, Dept Elect & Elect Engn, Bogota 111711, DC, Colombia
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 21期
关键词
acoustic signal processing; smart cities; convolutional neural network; raspberry Pi; deep learning;
D O I
10.3390/app10217448
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Real-time automatic identification of audio distress signals in urban areas is a task that in a smart city can improve response times in emergency alert systems. The main challenge in this problem lies in finding a model that is able to accurately recognize these type of signals in the presence of background noise and allows for real-time processing. In this paper, we present the design of a portable and low-cost device for accurate audio distress signal recognition in real urban scenarios based on deep learning models. As real audio distress recordings in urban areas have not been collected and made publicly available so far, we first constructed a database where audios were recorded in urban areas using a low-cost microphone. Using this database, we trained a deep multi-headed 2D convolutional neural network that processed temporal and frequency features to accurately recognize audio distress signals in noisy environments with a significant performance improvement to other methods from the literature. Then, we deployed and assessed the trained convolutional neural network model on a Raspberry Pi that, along with the low-cost microphone, constituted a device for accurate real-time audio recognition. Source code and database are publicly available. Dataset: https://github.com/jfgf11/Problema-Especial.git
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [1] Audio Distress Signal Recognition in Rural and Urban Areas using a WSN consisting of Portable Resource-Constrained Devices
    Ahmad, Syed Farhan
    Jha, Utkarsh
    Rawat, Raghav
    Govindaraju, M.
    2021 28TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (IEEE ICECS 2021), 2021,
  • [2] Deep Learning-Based Audio-Visual Speech Recognition for Bosnian Digits
    Fazlic, Husein
    Abd Almisre, Ali
    Tahir, Nooritawati Md
    JURNAL KEJURUTERAAN, 2024, 36 (01): : 147 - 154
  • [3] Automatic fabric pattern recognition and design based on deep learning and portable device
    Zhou, Xianke
    Li, Hang
    Zhang, Dejun
    INTERNET TECHNOLOGY LETTERS, 2023, 6 (05)
  • [4] Deep Learning and Audio Based Emotion Recognition
    Demir, Asli
    Atila, Orhan
    Sengur, Abdulkadir
    2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,
  • [5] Deep learning-based framework for expansion, recognition and classification of underwater acoustic signal
    Jin, Guanghao
    Liu, Fan
    Wu, Hao
    Song, Qingzeng
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2020, 32 (02) : 205 - 218
  • [6] Federated deep reinforcement learning-based urban traffic signal optimal control
    Li, Mi
    Pan, Xiaolong
    Liu, Chuhui
    Li, Zirui
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [7] Audio Recording Device Identification Based on Deep Learning
    Qi, Simeng
    Huang, Zheng
    Li, Yan
    Shi, Shaopei
    2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2016, : 426 - 431
  • [8] A Novel Deep Learning-based Prediction Approach for Groundwater Salinity Assessment of Urban Areas
    Abbasimaedeh, Pouyan
    Ferdosian, Nasim
    POLLUTION, 2023, 9 (02): : 712 - 725
  • [9] Deep learning-based intelligent detection of pavement distress
    Zheng, Lele
    Xiao, Jingjing
    Wang, Yinghui
    Wu, Wangjie
    Chen, Zhirong
    Yuan, Dongdong
    Jiang, Wei
    AUTOMATION IN CONSTRUCTION, 2024, 168
  • [10] Deep Learning-Based Model for Financial Distress Prediction
    Elhoseny, Mohamed
    Metawa, Noura
    Sztano, Gabor
    El-hasnony, Ibrahim M.
    ANNALS OF OPERATIONS RESEARCH, 2025, 345 (2-3) : 885 - 907