Hybrid attention network based on progressive embedding scale-context for crowd counting

被引:21
|
作者
Wang, Fusen [1 ,2 ]
Sang, Jun [1 ,2 ]
Wu, Zhongyuan [1 ,2 ]
Liu, Qi [1 ,2 ]
Sang, Nong [3 ]
机构
[1] Chongqing Univ, Minist Educ, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 401331, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430000, Peoples R China
基金
中国国家自然科学基金;
关键词
Crowd counting; Hybrid attention; Progressive embedding scale-context; Density map estimation;
D O I
10.1016/j.ins.2022.01.046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing crowd counting methods usually adopt attention mechanisms to tackle background noise, or apply multilevel features or multiscale context fusion to tackle scale variation. However, these approaches deal with these two problems separately. In this paper, we propose a hybrid attention network (HAN) by employing progressive embedding scale context (PES) information, which enables the network to simultaneously suppress noise and adapt head scale variation. We build the hybrid attention mechanism through two parallel spatial attention and channel attention modules, which makes the network focus more on the human head area and reduce the interference of background objects. In addition, we embed certain scale-context to the hybrid attention along the spatial and channel dimensions to alleviate the counting errors caused by the variation of perspective and head scale. Finally, we propose a progressive learning strategy through cascading multiple hybrid attention modules with embedding different scale contexts, which can gradually integrate different scale-context information into the current feature map from global to local. Ablation experiments show that the network architecture can gradually learn multi scale features and suppress background noise. Extensive experiments demonstrate that HANet obtains state-of-the-art counting performance on five mainstream datasets.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:306 / 318
页数:13
相关论文
共 50 条
  • [21] An encoder-decoder network for crowd counting based on multi-scale attention mechanism
    Chuang H.-H.
    Chen Y.-C.
    Lin C.H.
    Multimedia Tools and Applications, 2025, 84 (03) : 1187 - 1210
  • [22] Aggregated context network for crowd counting
    Si-yue Yu
    Jian Pu
    Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 1626 - 1638
  • [23] Shallow Feature Based Dense Attention Network for Crowd Counting
    Miao, Yunqi
    Lin, Zijia
    Ding, Guiguang
    Han, Jungong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11765 - 11772
  • [24] Multi-Scale and spatial position-based channel attention network for crowd counting
    Wang, Lin
    Li, Jie
    Zhang, Siqi
    Qi, Chun
    Wang, Pan
    Wang, Fengping
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90
  • [25] Lightweight multi-scale network with attention for accurate and efficient crowd counting
    Xi, Mengyuan
    Yan, Hua
    VISUAL COMPUTER, 2024, 40 (06): : 4553 - 4566
  • [26] Cascade-guided multi-scale attention network for crowd counting
    Shufang Li
    Zhengping Hu
    Mengyao Zhao
    Zhe Sun
    Signal, Image and Video Processing, 2021, 15 : 1663 - 1670
  • [27] MGSNet: A multi-scale and gated spatial attention network for crowd counting
    Shi, Ying
    Sang, Jun
    Wu, Zhongyuan
    Wang, Fusen
    Liu, Xinyue
    Xia, Xiaofeng
    Sang, Nong
    APPLIED INTELLIGENCE, 2022, 52 (13) : 15436 - 15446
  • [28] Cascade-guided multi-scale attention network for crowd counting
    Li, Shufang
    Hu, Zhengping
    Zhao, Mengyao
    Sun, Zhe
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (08) : 1663 - 1670
  • [29] MGSNet: A multi-scale and gated spatial attention network for crowd counting
    Ying Shi
    Jun Sang
    Zhongyuan Wu
    Fusen Wang
    Xinyue Liu
    Xiaofeng Xia
    Nong Sang
    Applied Intelligence, 2022, 52 : 15436 - 15446
  • [30] Dense Scale Network for Crowd Counting
    Dai, Feng
    Liu, Hao
    Ma, Yike
    Zhang, Xi
    Zhao, Qiang
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 64 - 72