STNet: Scale Tree Network With Multi-Level Auxiliator for Crowd Counting

被引:25
|
作者
Wang, Mingjie [1 ]
Cai, Hao [2 ]
Han, Xian-Feng [3 ]
Zhou, Jun [4 ]
Gong, Minglun [1 ]
机构
[1] Univ Guelph, Sch Comp Sci, Guelph, ON N1G 2W1, Canada
[2] Mem Univ Newfoundland, Dept Comp Sci, St John, NF A1B 3V6, Canada
[3] Southwest Univ, Chongqing 400715, Peoples R China
[4] Dalian Maritime Univ, Dalian 116026, Peoples R China
基金
加拿大自然科学与工程研究理事会;
关键词
Tree structure; scale enhancer; multi-level auxiliator; crowd counting; SEGMENTATION; PEOPLE;
D O I
10.1109/TMM.2022.3142398
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
State-of-the-art approaches for crowd counting resort to deepneural networks to predict density maps. However, counting people in congested scenes remains a challenging task because the presence of drastic scale variation, density inconsistency, and complex background can seriously degrade their counting accuracy. To battle the ingrained issue of accuracy degradation, in this paper, we propose a novel and powerful network called Scale Tree Network (STNet) for accurate crowd counting. STNet consists of two key components: a Scale-Tree Diversity Enhancer and a Multi-level Auxiliator. Specifically, the Diversity Enhancer is designed to enrich scale diversity, which alleviates limitations of existing methods caused by insufficient level of scales. A novel tree structure is adopted to hierarchically parse coarse-to-fine crowd regions. Furthermore, a simple yet effective Multi-level Auxiliator is presented to aid in exploiting generalisable shared characteristics at multiple levels, allowing more accurate pixel-wise background cognition. The overall STNet is trained in an end-to-end manner, without the needs for manually tuning loss weights between the main and the auxiliary tasks. Extensive experiments on five challenging crowd datasets demonstrate the superiority of the proposed method.
引用
收藏
页码:2074 / 2084
页数:11
相关论文
共 50 条
  • [41] Multi-scale dilated convolution of feature Fusion Network for Crowd counting
    Donghua Liu
    Guodong Wang
    Guangtao Zhai
    Multimedia Tools and Applications, 2022, 81 : 37939 - 37952
  • [42] Cascade-guided multi-scale attention network for crowd counting
    Shufang Li
    Zhengping Hu
    Mengyao Zhao
    Zhe Sun
    Signal, Image and Video Processing, 2021, 15 : 1663 - 1670
  • [43] An Enhanced Scale Robust Network for Crowd Counting
    Liu, Caihua
    Duan, Yinong
    Du, Jiahao
    Xu, Tao
    IEEE ACCESS, 2020, 8 : 48352 - 48360
  • [44] End to End Multi-Scale Convolutional Neural Network for Crowd Counting
    Ji, Deyi
    Lu, Hongtao
    Zhang, Tongzhen
    ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018), 2019, 11041
  • [45] Crowd Counting via Residual Multi-scale Convolutional Neural Network
    Lu, Jingang
    Zhang, Li
    2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 315 - 320
  • [46] MGSNet: A multi-scale and gated spatial attention network for crowd counting
    Shi, Ying
    Sang, Jun
    Wu, Zhongyuan
    Wang, Fusen
    Liu, Xinyue
    Xia, Xiaofeng
    Sang, Nong
    APPLIED INTELLIGENCE, 2022, 52 (13) : 15436 - 15446
  • [47] Cascade-guided multi-scale attention network for crowd counting
    Li, Shufang
    Hu, Zhengping
    Zhao, Mengyao
    Sun, Zhe
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (08) : 1663 - 1670
  • [48] MGSNet: A multi-scale and gated spatial attention network for crowd counting
    Ying Shi
    Jun Sang
    Zhongyuan Wu
    Fusen Wang
    Xinyue Liu
    Xiaofeng Xia
    Nong Sang
    Applied Intelligence, 2022, 52 : 15436 - 15446
  • [49] An Adaptive Multi-Scale Network Based on Depth Information for Crowd Counting
    Zhang, Peng
    Lei, Weimin
    Zhao, Xinlei
    Dong, Lijia
    Lin, Zhaonan
    SENSORS, 2023, 23 (18)
  • [50] Multi-scale dilated convolution of feature Fusion Network for Crowd counting
    Liu, Donghua
    Wang, Guodong
    Zhai, Guangtao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (26) : 37939 - 37952