Multidimensional Measure Matching for Crowd Counting

被引:0
|
作者
Lin, Hui [1 ]
Hong, Xiaopeng [2 ,3 ]
Ma, Zhiheng [4 ,5 ,6 ]
Wang, Yaowei [3 ,7 ]
Meng, Deyu [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China
[2] Harbin Inst Technol, Fac Comp, Harbin 150001, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518066, Peoples R China
[4] Shenzhen Univ Adv Technol, Fac Computil Microelect, Shenzhen 518107, Peoples R China
[5] Chinese Acad Sci, Guangdong Prov Key Lab Computil Microelect, Shenzhen 518067, Peoples R China
[6] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518067, Peoples R China
[7] Harbin Inst Technol Shenzhen, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Estimation; Annotations; Transformers; Kernel; Density measurement; Computer vision; Training; Crowd counting; deep learning; multiscale; Sinkhorn divergence;
D O I
10.1109/TNNLS.2024.3435854
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article addresses the challenge of scale variations in crowd-counting problems from a multidimensional measure-theoretic perspective. We start by formulating crowd counting as a measure-matching problem, based on the assumption that discrete measures can express the scattered ground truth and the predicted density map. In this context, we introduce the Sinkhorn counting loss and extend it to the semi-balanced form, which alleviates the problems including entropic bias, distance destruction, and amount constraints. We then model the measure matching under the multidimensional space, in order to learn the counting from both location and scale. To achieve this, we extend the traditional 2-D coordinate support to 3-D, incorporating an additional axis to represent scale information, where a pyramid-based structure will be leveraged to learn the scale value for the predicted density. Extensive experiments on four challenging crowd-counting datasets, namely, ShanghaiTech A, UCF-QNRF, JHU ++, and NWPU have validated the proposed method. Code is released at https://github.com/LoraLinH/Multidimensional-Measure-Matching-for-Crowd-Counting.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Aggregated context network for crowd counting
    Yu, Si-yue
    Pu, Jian
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (11) : 1626 - 1638
  • [42] A viewpoint invariant approach for crowd counting
    Kong, Dan
    Gray, Doug
    Tao, Hai
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 1187 - +
  • [43] ADAPTIVE SCENARIO DISCOVERY FOR CROWD COUNTING
    Wu, Xingjiao
    Zheng, Yingbin
    Ye, Hao
    Hu, Wenxin
    Yang, Jing
    He, Liang
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2382 - 2386
  • [44] DECCNet: Depth Enhanced Crowd Counting
    Yang, Shuo-Diao
    Su, Hung-Ting
    Hsu, Winston H.
    Chen, Wen-Chin
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4521 - 4530
  • [45] Dense Crowd Counting with Capsule Networks
    Roldao Reis, Victor Hugo
    Guimaraes, Silvio Jamil F.
    do Patrocinio Jr, Zenilton Kleber Goncalves
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 27TH EDITION, 2020, : 267 - 272
  • [46] Convolutional Attention Network for Crowd Counting
    Zhu, Yubin
    Li, Wengen
    Guan, Jihong
    Zhang, Yichao
    Computer Engineering and Applications, 2023, 59 (01) : 156 - 161
  • [47] Crowd Counting Using Accumulated HOG
    Xu, Tianchun
    Chen, Xiaohui
    Wei, Guo
    Wang, Weidong
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1877 - 1881
  • [48] Relational Attention Network for Crowd Counting
    Zhang, Anran
    Shen, Jiayi
    Xiao, Zehao
    Zhu, Fan
    Zhen, Xiantong
    Cao, Xianbin
    Shao, Ling
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6787 - 6796
  • [49] Relevant region prediction for crowd counting
    Chen, Xinya
    Bin, Yanrui
    Gao, Changxin
    Sang, Nong
    Tang, Hao
    NEUROCOMPUTING, 2020, 407 (407) : 399 - 408
  • [50] AECNet: Attentive EfficientNet For Crowd Counting
    Dosi, Muskan
    Thakral, Kartik
    Mittal, Surbhi
    Vatsa, Mayank
    Singh, Richa
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,