Hybrid attention network based on progressive embedding scale-context for crowd counting

被引:21
|
作者
Wang, Fusen [1 ,2 ]
Sang, Jun [1 ,2 ]
Wu, Zhongyuan [1 ,2 ]
Liu, Qi [1 ,2 ]
Sang, Nong [3 ]
机构
[1] Chongqing Univ, Minist Educ, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 401331, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430000, Peoples R China
基金
中国国家自然科学基金;
关键词
Crowd counting; Hybrid attention; Progressive embedding scale-context; Density map estimation;
D O I
10.1016/j.ins.2022.01.046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing crowd counting methods usually adopt attention mechanisms to tackle background noise, or apply multilevel features or multiscale context fusion to tackle scale variation. However, these approaches deal with these two problems separately. In this paper, we propose a hybrid attention network (HAN) by employing progressive embedding scale context (PES) information, which enables the network to simultaneously suppress noise and adapt head scale variation. We build the hybrid attention mechanism through two parallel spatial attention and channel attention modules, which makes the network focus more on the human head area and reduce the interference of background objects. In addition, we embed certain scale-context to the hybrid attention along the spatial and channel dimensions to alleviate the counting errors caused by the variation of perspective and head scale. Finally, we propose a progressive learning strategy through cascading multiple hybrid attention modules with embedding different scale contexts, which can gradually integrate different scale-context information into the current feature map from global to local. Ablation experiments show that the network architecture can gradually learn multi scale features and suppress background noise. Extensive experiments demonstrate that HANet obtains state-of-the-art counting performance on five mainstream datasets.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:306 / 318
页数:13
相关论文
共 50 条
  • [31] Scale Pyramid Network for Crowd Counting
    Chen, Xinya
    Bin, Yanrui
    Sang, Nong
    Gao, Changxin
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1941 - 1950
  • [32] A scale adaptive network for crowd counting
    Zhang, Youmei
    Zhou, Chunluan
    Chang, Faliang
    Kot, Alex C.
    NEUROCOMPUTING, 2019, 362 : 139 - 146
  • [33] Progressive Crowd Enhancement De-Background Network for crowd counting
    Wang, Lin
    Li, Jie
    Qi, Chun
    Wang, Fengping
    Wang, Pan
    VISUAL COMPUTER, 2024, : 3695 - 3717
  • [34] Crowd counting in complex scenes based on an attention aware CNN network
    Li, Zhaoxin
    Lu, Shuhua
    Lan, Lingqiang
    Liu, Qiyuan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [35] Crowd counting method based on the self-attention residual network
    Liu, Yan-Bo
    Jia, Rui-Sheng
    Liu, Qing-Ming
    Zhang, Xing-Li
    Sun, Hong-Mei
    APPLIED INTELLIGENCE, 2021, 51 (01) : 427 - 440
  • [36] Adaptive Context Learning Network for Crowd Counting
    Liu, Zhao
    Zeng, Guanqi
    Feng, Zunlei
    Zhang, Rong
    Song, Mingli
    Shen, Jianping
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 4103 - 4109
  • [37] ACCNet: Attention-based Contextual Convolutional Network for Crowd Counting
    Huang, Yaoying
    Zhu, Aichun
    Duan, Guoxiu
    Hu, Fangqiang
    Li, Yifeng
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 1926 - 1931
  • [38] Crowd counting method based on the self-attention residual network
    Yan-Bo Liu
    Rui-Sheng Jia
    Qing-Ming Liu
    Xing-Li Zhang
    Hong-Mei Sun
    Applied Intelligence, 2021, 51 : 427 - 440
  • [39] MSFFA: a multi-scale feature fusion and attention mechanism network for crowd counting
    Li, Zhaoxin
    Lu, Shuhua
    Dong, Yishan
    Guo, Jingyuan
    VISUAL COMPUTER, 2023, 39 (03): : 1045 - 1056
  • [40] Crowd counting using a self-attention multi-scale cascaded network
    Li, He
    Zhang, Shihui
    Kong, Weihang
    IET COMPUTER VISION, 2019, 13 (06) : 556 - 561