Self-attention Guidance Based Crowd Localization and Counting

被引:1
|
作者
Ma, Zhouzhou [1 ,2 ]
Gu, Guanghua [1 ,2 ]
Zhao, Wenrui [1 ,2 ]
机构
[1] Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao 066000, Peoples R China
[2] Hebei Key Lab Informat Transmiss & Signal Proc, Qinhuangdao 066000, Peoples R China
基金
中国国家自然科学基金;
关键词
Crowd localization; crowd counting; transformer; point supervision; object detection; IMAGE; NETWORK;
D O I
10.1007/s11633-023-1428-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most existing studies on crowd analysis are limited to the level of counting, which cannot provide the exact location of individuals. This paper proposes a self-attention guidance based crowd localization and counting network (SA-CLCN), which can simultaneously locate and count crowds. We take the form of object detection, using the original point annotations of crowd datasets as supervision to train the network. Ultimately, the center point coordinate of each head as well as the number of crowds are predicted. Specifically, to cope with the spatial and positional variations of the crowd, the proposed method introduces transformer to construct a globallocal feature extractor (GLFE) together with the convolutional structure. It establishes the near-to-far dependency between elements so that the global context and local detail features of the crowd image can be extracted simultaneously. Then, this paper designs a pyramid feature fusion module (PFFM) to fuse the global and local information from high level to low level to obtain a multiscale feature representation. In downstream tasks, this paper predicts candidate point offsets and confidence scores by a simple regression header and classification header. In addition, the Hungarian algorithm is used to match the predicted point set and the labelled point set to facilitate the calculation of losses. The proposed network avoids the errors or higher costs associated with using traditional density maps or bounding box annotations. Importantly, we have conducted extensive experiments on several crowd datasets, and the proposed method has produced competitive results in both counting and localization.
引用
收藏
页码:966 / 982
页数:17
相关论文
共 50 条
  • [31] Jointly attention network for crowd counting
    He, Yuqiang
    Xia, Yinfeng
    Wang, Yizhen
    Yin, Baoqun
    NEUROCOMPUTING, 2022, 487 : 157 - 171
  • [32] Motional foreground attention-based video crowd counting
    Ling, Miaogen
    Pan, Tianhang
    Ren, Yi
    Wang, Ke
    Geng, Xin
    PATTERN RECOGNITION, 2023, 144
  • [33] Convolutional Attention Network for Crowd Counting
    Zhu, Yubin
    Li, Wengen
    Guan, Jihong
    Zhang, Yichao
    Computer Engineering and Applications, 2023, 59 (01) : 156 - 161
  • [34] Relational Attention Network for Crowd Counting
    Zhang, Anran
    Shen, Jiayi
    Xiao, Zehao
    Zhu, Fan
    Zhen, Xiantong
    Cao, Xianbin
    Shao, Ling
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6787 - 6796
  • [35] Attention to Head Locations for Crowd Counting
    Zhang, Youmei
    Zhou, Chunluan
    Chang, Faliang
    Kot, Alex C.
    Zhang, Wei
    IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 727 - 737
  • [36] Crowd Counting Guided by Attention Network
    Nie, Pei
    Fan, Cien
    Zou, Lian
    Chen, Liqiong
    Li, Xiaopeng
    INFORMATION, 2020, 11 (12) : 1 - 10
  • [37] Unsupervised Pansharpening Based on Self-Attention Mechanism
    Qu, Ying
    Baghbaderani, Razieh Kaviani
    Qi, Hairong
    Kwan, Chiman
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (04): : 3192 - 3208
  • [38] Keyphrase Generation Based on Self-Attention Mechanism
    Yang, Kehua
    Wang, Yaodong
    Zhang, Wei
    Yao, Jiqing
    Le, Yuquan
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 61 (02): : 569 - 581
  • [39] Self-Attention Based Network for Punctuation Restoration
    Wang, Feng
    Chen, Wei
    Yang, Zhen
    Xu, Bo
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2803 - 2808
  • [40] Session-Based Recommendation with Self-Attention
    Anh, Pharr Hoang
    Bach, Ngo Xuan
    Phuong, Tu Minh
    SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 1 - 8