Self-attention Guidance Based Crowd Localization and Counting

被引：1

作者：

Ma, Zhouzhou ^{[1
,2
]}

Gu, Guanghua ^{[1
,2
]}

Zhao, Wenrui ^{[1
,2
]}

机构：

[1] Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao 066000, Peoples R China

[2] Hebei Key Lab Informat Transmiss & Signal Proc, Qinhuangdao 066000, Peoples R China

来源：

MACHINE INTELLIGENCE RESEARCH | 2024年 / 21卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Crowd localization; crowd counting; transformer; point supervision; object detection; IMAGE; NETWORK;

D O I：

10.1007/s11633-023-1428-6

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Most existing studies on crowd analysis are limited to the level of counting, which cannot provide the exact location of individuals. This paper proposes a self-attention guidance based crowd localization and counting network (SA-CLCN), which can simultaneously locate and count crowds. We take the form of object detection, using the original point annotations of crowd datasets as supervision to train the network. Ultimately, the center point coordinate of each head as well as the number of crowds are predicted. Specifically, to cope with the spatial and positional variations of the crowd, the proposed method introduces transformer to construct a globallocal feature extractor (GLFE) together with the convolutional structure. It establishes the near-to-far dependency between elements so that the global context and local detail features of the crowd image can be extracted simultaneously. Then, this paper designs a pyramid feature fusion module (PFFM) to fuse the global and local information from high level to low level to obtain a multiscale feature representation. In downstream tasks, this paper predicts candidate point offsets and confidence scores by a simple regression header and classification header. In addition, the Hungarian algorithm is used to match the predicted point set and the labelled point set to facilitate the calculation of losses. The proposed network avoids the errors or higher costs associated with using traditional density maps or bounding box annotations. Importantly, we have conducted extensive experiments on several crowd datasets, and the proposed method has produced competitive results in both counting and localization.

引用

页码：966 / 982

页数：17

共 50 条

[21] Application of improved transformer based on weakly supervised in crowd localization and crowd counting
Hui Gao
Wenjun Zhao
Dexian Zhang
Miaolei Deng
Scientific Reports, 13
[22] Application of improved transformer based on weakly supervised in crowd localization and crowd counting
Gao, Hui
Zhao, Wenjun
Zhang, Dexian
Deng, Miaolei
SCIENTIFIC REPORTS, 2023, 13 (01)
[23] Crowd counting with crowd attention convolutional neural network
Chen, Jiwei
Su, Wen
Wang, Zengfu
NEUROCOMPUTING, 2020, 382 : 210 - 220
[24] Application of solely self-attention mechanism in CSI-fingerprinting-based indoor localization
Kabo Poloko Nkabiti
Yueyun Chen
Neural Computing and Applications, 2021, 33 : 9185 - 9198
[25] Application of solely self-attention mechanism in CSI-fingerprinting-based indoor localization
Nkabiti, Kabo Poloko
Chen, Yueyun
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (15): : 9185 - 9198
[26] Self-Attention Guidance and Multiscale Feature Fusion-Based UAV Image Object Detection
Zhang, Yunzuo
Wu, Cunyu
Zhang, Tian
Liu, Yameng
Zheng, Yuxin
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[27] EEG-Based Emotion Recognition With Emotion Localization via Hierarchical Self-Attention
Zhang, Yuzhe
Liu, Huan
Zhang, Dalin
Chen, Xuxu
Qin, Tao
Zheng, Qinghua
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2458 - 2469
[28] Spatial-Temporal Action Localization With Hierarchical Self-Attention
Pramono, Rizard Renanda Adhi
Chen, Yie-Tarng
Fang, Wen-Hsien
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 625 - 639
[29] Improving Sample Quality of Diffusion Models Using Self-Attention Guidance
Hong, Susung
Lee, Gyuseong
Jang, Wooseok
Kim, Seungryong
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 7428 - 7437
[30] Shallow Feature Based Dense Attention Network for Crowd Counting
Miao, Yunqi
Lin, Zijia
Ding, Guiguang
Han, Jungong
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11765 - 11772

← 1 2 3 4 5 →