Multi-Scale Context Aggregation Network with Attention-Guided for Crowd Counting

被引：17

作者：

Wang, Xin ^{[1
,2
]}

Lv, Rongrong ^{[1
]}

Zhao, Yang ^{[2
]}

Yang, Tangwen ^{[1
]}

Ruan, Qiuqi ^{[1
,2
]}

机构：

[1] Beijing Jiaotong Univ, Inst Informat Sci, Sch Comp & Informat Technol, Beijing 100044, Peoples R China

[2] Shenzhen Univ, Guangdong Key Lab Intelligent Informat Proc, Shenzhen 518060, Guangdong, Peoples R China

来源：

PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020) | 2020年

关键词：

dense context-aware module; hierarchical attention guided; multi-scale extraction; crowd counting;

D O I：

10.1109/ICSP48669.2020.9321067

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Crowd counting aims to predict the number of people and generate the density map in the image. There are many challenges, including varying head scales, the diversity of crowd distribution across images and cluttered backgrounds. In this paper, we propose a multi-scale context aggregation network (MSCANet) based on single column encoder-decoder architecture for crowd counting, which consists of an encoder based on dense context-aware module (DCAM) and a hierarchical attention-guided decoder. To handle the issue of scale variation, we construct the DCAM to aggregate multi-scale contextual information through densely connecting the dilated convolution with varying receptive fields. The proposed DCAM can capture rich contextual information of crowd areas due to its long-range receptive fields and dense scale sampling. Moreover, to suppress the background noise and generate a high-quality density map, we adopt a hierarchical attention-guided mechanism in the decoder. This helps to integrate more useful spatial information from shallow feature maps of the encoder by introducing multiple supervision based on semantic attention module (SAM). Extensive experiments demonstrate that the proposed approach achieves better performance than other similar state-of-the-art methods on three challenging benchmark datasets for crowd counting.

引用

页码：240 / 245

页数：6

共 50 条

[31] Compare and Focus: Multi-Scale View Aggregation for Crowd Counting
Jiang, Shengqin
Cai, Jialu
Zhang, Haokui
Liu, Yu
Liu, Qingshan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 13231 - 13239
[32] Redesigning Multi-Scale Neural Network for Crowd Counting
Du, Zhipeng
Shi, Miaojing
Deng, Jiankang
Zafeiriou, Stefanos
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3664 - 3678
[33] An encoder-decoder network for crowd counting based on multi-scale attention mechanism
Chuang H.-H.
Chen Y.-C.
Lin C.H.
Multimedia Tools and Applications, 2025, 84 (03) : 1187 - 1210
[34] GLIMS: Attention-guided lightweight multi-scale hybrid network for volumetric semantic segmentation
Yazici, Ziya Ata
Oksuz, Ilkay
Ekenel, Hazim Kemal
IMAGE AND VISION COMPUTING, 2024, 146
[35] Multi-Scale and spatial position-based channel attention network for crowd counting
Wang, Lin
Li, Jie
Zhang, Siqi
Qi, Chun
Wang, Pan
Wang, Fengping
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90
[36] Dense Dilated Multi-Scale Supervised Attention-Guided Network for histopathology image segmentation
Das, Rangan
Bose, Shirsha
Chowdhury, Ritesh Sur
Maulik, Ujjwal
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163
[37] Attention-guided multi-scale learning network for automatic prostate and tumor segmentation on MRI
Li, Yuchun
Wu, Yuanyuan
Huang, Mengxing
Zhang, Yu
Bai, Zhiming
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165
[38] ADMNet: Attention-Guided Densely Multi-Scale Network for Lightweight Salient Object Detection
Zhou, Xiaofei
Shen, Kunye
Liu, Zhi
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 10828 - 10841
[39] Crowd Counting Guided by Attention Network
Nie, Pei
Fan, Cien
Zou, Lian
Chen, Liqiong
Li, Xiaopeng
INFORMATION, 2020, 11 (12) : 1 - 10
[40] Multi-Scale Attention-Guided Non-Local Network for HDR Image Reconstruction
Yoon, Howoon
Uddin, S. M. Nadim
Jung, Yong Ju
SENSORS, 2022, 22 (18)

← 1 2 3 4 5 →