Estimating Rainfall from Surveillance Audio Based on Parallel Network with Multi-Scale Fusion and Attention Mechanism

被引:6
|
作者
Chen, Mingzheng [1 ,2 ,3 ]
Wang, Xing [1 ,2 ,3 ,4 ]
Wang, Meizhen [1 ,2 ,3 ]
Liu, Xuejun [1 ,2 ,3 ]
Wu, Yong [5 ]
Wang, Xiaochu [1 ,2 ,3 ]
机构
[1] Nanjing Normal Univ, Minist Educ, Key Lab Virtual Geog Environm, Nanjing 210023, Peoples R China
[2] State Key Lab Cultivat Base Geog Environm Evolut, Nanjing 210023, Peoples R China
[3] Jiangsu Ctr Collaborat Innovat Geog Informat Reso, Nanjing 210023, Peoples R China
[4] Univ Vienna, Dept Geog & Reg Res, A-1010 Vienna, Austria
[5] Fujian Normal Univ, Inst Geog, Fuzhou 350000, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
rainfall estimation; surveillance audio; machine learning; multi-scale fusion; CLASSIFICATION; RECOGNITION; RESOLUTION;
D O I
10.3390/rs14225750
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Rainfall data have a profound significance for meteorology, climatology, hydrology, and environmental sciences. However, existing rainfall observation methods (including ground-based rain gauges and radar-/satellite-based remote sensing) are not efficient in terms of spatiotemporal resolution and cannot meet the needs of high-resolution application scenarios (urban waterlogging, emergency rescue, etc.). Widespread surveillance cameras have been regarded as alternative rain gauges in existing studies. Surveillance audio, through exploiting their nonstop use to record rainfall acoustic signals, should be considered a type of data source to obtain high-resolution and all-weather data. In this study, a method named parallel neural network based on attention mechanisms and multi-scale fusion (PNNAMMS) is proposed for automatically classifying rainfall levels by surveillance audio. The proposed model employs a parallel dual-channel network with spatial channel extracting the frequency domain correlation, and temporal channel capturing the time-domain continuity of the rainfall sound. Additionally, attention mechanisms are used on the two channels to obtain significant spatiotemporal elements. A multi-scale fusion method was adopted to fuse different scale features in the spatial channel for more robust performance in complex surveillance scenarios. In experiments showed that our method achieved an estimation accuracy of 84.64% for rainfall levels and outperformed previously proposed methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Parallel multi-scale network with attention mechanism for pancreas segmentation
    Long, Jianwu
    Song, Xinlei
    An, Yong
    Li, Tong
    Zhu, Jiangzhou
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2022, 17 (01) : 110 - 119
  • [2] MFANet: Multi-scale feature fusion network with attention mechanism
    Wang, Gaihua
    Gan, Xin
    Cao, Qingcheng
    Zhai, Qianyu
    VISUAL COMPUTER, 2023, 39 (07): : 2969 - 2980
  • [3] MFANet: Multi-scale feature fusion network with attention mechanism
    Gaihua Wang
    Xin Gan
    Qingcheng Cao
    Qianyu Zhai
    The Visual Computer, 2023, 39 : 2969 - 2980
  • [4] Audio steganalysis using multi-scale feature fusion-based attention neural network
    Peng, Jinghui
    Liao, Yi
    Tang, Shanyu
    IET COMMUNICATIONS, 2025, 19 (01)
  • [5] Attention based multi-scale parallel network for polyp segmentation
    Song, Pengfei
    Li, Jinjiang
    Fan, Hui
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [6] Bridge Crack Segmentation Method Based on Parallel Attention Mechanism and Multi-Scale Features Fusion
    Yuan, Jianwei
    Song, Xinli
    Pu, Huaijian
    Zheng, Zhixiong
    Niu, Ziyang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 6485 - 6503
  • [7] Enhancing EEG and sEMG Fusion Decoding Using a Multi-Scale Parallel Convolutional Network With Attention Mechanism
    Tang, Xianlun
    Qi, Yidan
    Zhang, Jing
    Liu, Ke
    Tian, Yin
    Gao, Xinbo
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2024, 32 : 212 - 222
  • [8] Hourglass Dehazing Network Based on Multi-scale Parallel Fusion
    Mao, Yishu
    Song, Xingcehn
    Zhang, Xinman
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 395 - 400
  • [9] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
    Xu, Dongdong
    Zhang, Ning
    Zhang, Yuxi
    Li, Zheng
    Zhao, Zhikang
    Wang, Yongcheng
    Infrared Physics and Technology, 2022, 125
  • [10] Multi-scale Convolutional Feature Fusion Network Based on Attention Mechanism for IoT Traffic Classification
    Niandong Liao
    Jiayu Guan
    International Journal of Computational Intelligence Systems, 17