Multidimensional Measure Matching for Crowd Counting

被引：0

作者：

Lin, Hui ^{[1
]}

Hong, Xiaopeng ^{[2
,3
]}

Ma, Zhiheng ^{[4
,5
,6
]}

Wang, Yaowei ^{[3
,7
]}

Meng, Deyu ^{[1
]}

机构：

[1] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China

[2] Harbin Inst Technol, Fac Comp, Harbin 150001, Peoples R China

[3] Peng Cheng Lab, Shenzhen 518066, Peoples R China

[4] Shenzhen Univ Adv Technol, Fac Computil Microelect, Shenzhen 518107, Peoples R China

[5] Chinese Acad Sci, Guangdong Prov Key Lab Computil Microelect, Shenzhen 518067, Peoples R China

[6] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518067, Peoples R China

[7] Harbin Inst Technol Shenzhen, Shenzhen 518055, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年

基金：

中国国家自然科学基金;

关键词：

Estimation; Annotations; Transformers; Kernel; Density measurement; Computer vision; Training; Crowd counting; deep learning; multiscale; Sinkhorn divergence;

D O I：

10.1109/TNNLS.2024.3435854

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article addresses the challenge of scale variations in crowd-counting problems from a multidimensional measure-theoretic perspective. We start by formulating crowd counting as a measure-matching problem, based on the assumption that discrete measures can express the scattered ground truth and the predicted density map. In this context, we introduce the Sinkhorn counting loss and extend it to the semi-balanced form, which alleviates the problems including entropic bias, distance destruction, and amount constraints. We then model the measure matching under the multidimensional space, in order to learn the counting from both location and scale. To achieve this, we extend the traditional 2-D coordinate support to 3-D, incorporating an additional axis to represent scale information, where a pyramid-based structure will be leveraged to learn the scale value for the predicted density. Extensive experiments on four challenging crowd-counting datasets, namely, ShanghaiTech A, UCF-QNRF, JHU ++, and NWPU have validated the proposed method. Code is released at https://github.com/LoraLinH/Multidimensional-Measure-Matching-for-Crowd-Counting.

引用

页数：15

共 50 条

[41] Aggregated context network for crowd counting
Yu, Si-yue
Pu, Jian
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (11) : 1626 - 1638
[42] A viewpoint invariant approach for crowd counting
Kong, Dan
Gray, Doug
Tao, Hai
18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 1187 - +
[43] ADAPTIVE SCENARIO DISCOVERY FOR CROWD COUNTING
Wu, Xingjiao
Zheng, Yingbin
Ye, Hao
Hu, Wenxin
Yang, Jing
He, Liang
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2382 - 2386
[44] DECCNet: Depth Enhanced Crowd Counting
Yang, Shuo-Diao
Su, Hung-Ting
Hsu, Winston H.
Chen, Wen-Chin
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4521 - 4530
[45] Dense Crowd Counting with Capsule Networks
Roldao Reis, Victor Hugo
Guimaraes, Silvio Jamil F.
do Patrocinio Jr, Zenilton Kleber Goncalves
PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 27TH EDITION, 2020, : 267 - 272
[46] Convolutional Attention Network for Crowd Counting
Zhu, Yubin
Li, Wengen
Guan, Jihong
Zhang, Yichao
Computer Engineering and Applications, 2023, 59 (01) : 156 - 161
[47] Crowd Counting Using Accumulated HOG
Xu, Tianchun
Chen, Xiaohui
Wei, Guo
Wang, Weidong
2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1877 - 1881
[48] Relational Attention Network for Crowd Counting
Zhang, Anran
Shen, Jiayi
Xiao, Zehao
Zhu, Fan
Zhen, Xiantong
Cao, Xianbin
Shao, Ling
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6787 - 6796
[49] Relevant region prediction for crowd counting
Chen, Xinya
Bin, Yanrui
Gao, Changxin
Sang, Nong
Tang, Hao
NEUROCOMPUTING, 2020, 407 (407) : 399 - 408
[50] AECNet: Attentive EfficientNet For Crowd Counting
Dosi, Muskan
Thakral, Kartik
Mittal, Surbhi
Vatsa, Mayank
Singh, Richa
2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,

← 1 2 3 4 5 →