Estimating Rainfall from Surveillance Audio Based on Parallel Network with Multi-Scale Fusion and Attention Mechanism

被引：6

作者：

Chen, Mingzheng ^{[1
,2
,3
]}

Wang, Xing ^{[1
,2
,3
,4
]}

Wang, Meizhen ^{[1
,2
,3
]}

Liu, Xuejun ^{[1
,2
,3
]}

Wu, Yong ^{[5
]}

Wang, Xiaochu ^{[1
,2
,3
]}

机构：

[1] Nanjing Normal Univ, Minist Educ, Key Lab Virtual Geog Environm, Nanjing 210023, Peoples R China

[2] State Key Lab Cultivat Base Geog Environm Evolut, Nanjing 210023, Peoples R China

[3] Jiangsu Ctr Collaborat Innovat Geog Informat Reso, Nanjing 210023, Peoples R China

[4] Univ Vienna, Dept Geog & Reg Res, A-1010 Vienna, Austria

[5] Fujian Normal Univ, Inst Geog, Fuzhou 350000, Peoples R China

来源：

REMOTE SENSING | 2022年 / 14卷 / 22期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

rainfall estimation; surveillance audio; machine learning; multi-scale fusion; CLASSIFICATION; RECOGNITION; RESOLUTION;

D O I：

10.3390/rs14225750

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Rainfall data have a profound significance for meteorology, climatology, hydrology, and environmental sciences. However, existing rainfall observation methods (including ground-based rain gauges and radar-/satellite-based remote sensing) are not efficient in terms of spatiotemporal resolution and cannot meet the needs of high-resolution application scenarios (urban waterlogging, emergency rescue, etc.). Widespread surveillance cameras have been regarded as alternative rain gauges in existing studies. Surveillance audio, through exploiting their nonstop use to record rainfall acoustic signals, should be considered a type of data source to obtain high-resolution and all-weather data. In this study, a method named parallel neural network based on attention mechanisms and multi-scale fusion (PNNAMMS) is proposed for automatically classifying rainfall levels by surveillance audio. The proposed model employs a parallel dual-channel network with spatial channel extracting the frequency domain correlation, and temporal channel capturing the time-domain continuity of the rainfall sound. Additionally, attention mechanisms are used on the two channels to obtain significant spatiotemporal elements. A multi-scale fusion method was adopted to fuse different scale features in the spatial channel for more robust performance in complex surveillance scenarios. In experiments showed that our method achieved an estimation accuracy of 84.64% for rainfall levels and outperformed previously proposed methods.

引用

页数：17

共 50 条

[1] Parallel multi-scale network with attention mechanism for pancreas segmentation
Long, Jianwu
Song, Xinlei
An, Yong
Li, Tong
Zhu, Jiangzhou
IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2022, 17 (01) : 110 - 119
[2] MFANet: Multi-scale feature fusion network with attention mechanism
Wang, Gaihua
Gan, Xin
Cao, Qingcheng
Zhai, Qianyu
VISUAL COMPUTER, 2023, 39 (07): : 2969 - 2980
[3] MFANet: Multi-scale feature fusion network with attention mechanism
Gaihua Wang
Xin Gan
Qingcheng Cao
Qianyu Zhai
The Visual Computer, 2023, 39 : 2969 - 2980
[4] Audio steganalysis using multi-scale feature fusion-based attention neural network
Peng, Jinghui
Liao, Yi
Tang, Shanyu
IET COMMUNICATIONS, 2025, 19 (01)
[5] Attention based multi-scale parallel network for polyp segmentation
Song, Pengfei
Li, Jinjiang
Fan, Hui
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
[6] Bridge Crack Segmentation Method Based on Parallel Attention Mechanism and Multi-Scale Features Fusion
Yuan, Jianwei
Song, Xinli
Pu, Huaijian
Zheng, Zhixiong
Niu, Ziyang
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 6485 - 6503
[7] Enhancing EEG and sEMG Fusion Decoding Using a Multi-Scale Parallel Convolutional Network With Attention Mechanism
Tang, Xianlun
Qi, Yidan
Zhang, Jing
Liu, Ke
Tian, Yin
Gao, Xinbo
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2024, 32 : 212 - 222
[8] Hourglass Dehazing Network Based on Multi-scale Parallel Fusion
Mao, Yishu
Song, Xingcehn
Zhang, Xinman
2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 395 - 400
[9] Multi-scale unsupervised network for infrared and visible image fusion based on joint attention mechanism
Xu, Dongdong
Zhang, Ning
Zhang, Yuxi
Li, Zheng
Zhao, Zhikang
Wang, Yongcheng
Infrared Physics and Technology, 2022, 125
[10] Multi-scale Convolutional Feature Fusion Network Based on Attention Mechanism for IoT Traffic Classification
Niandong Liao
Jiayu Guan
International Journal of Computational Intelligence Systems, 17

← 1 2 3 4 5 →