STNet: Scale Tree Network With Multi-Level Auxiliator for Crowd Counting

被引:25
|
作者
Wang, Mingjie [1 ]
Cai, Hao [2 ]
Han, Xian-Feng [3 ]
Zhou, Jun [4 ]
Gong, Minglun [1 ]
机构
[1] Univ Guelph, Sch Comp Sci, Guelph, ON N1G 2W1, Canada
[2] Mem Univ Newfoundland, Dept Comp Sci, St John, NF A1B 3V6, Canada
[3] Southwest Univ, Chongqing 400715, Peoples R China
[4] Dalian Maritime Univ, Dalian 116026, Peoples R China
基金
加拿大自然科学与工程研究理事会;
关键词
Tree structure; scale enhancer; multi-level auxiliator; crowd counting; SEGMENTATION; PEOPLE;
D O I
10.1109/TMM.2022.3142398
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
State-of-the-art approaches for crowd counting resort to deepneural networks to predict density maps. However, counting people in congested scenes remains a challenging task because the presence of drastic scale variation, density inconsistency, and complex background can seriously degrade their counting accuracy. To battle the ingrained issue of accuracy degradation, in this paper, we propose a novel and powerful network called Scale Tree Network (STNet) for accurate crowd counting. STNet consists of two key components: a Scale-Tree Diversity Enhancer and a Multi-level Auxiliator. Specifically, the Diversity Enhancer is designed to enrich scale diversity, which alleviates limitations of existing methods caused by insufficient level of scales. A novel tree structure is adopted to hierarchically parse coarse-to-fine crowd regions. Furthermore, a simple yet effective Multi-level Auxiliator is presented to aid in exploiting generalisable shared characteristics at multiple levels, allowing more accurate pixel-wise background cognition. The overall STNet is trained in an end-to-end manner, without the needs for manually tuning loss weights between the main and the auxiliary tasks. Extensive experiments on five challenging crowd datasets demonstrate the superiority of the proposed method.
引用
收藏
页码:2074 / 2084
页数:11
相关论文
共 50 条
  • [21] Multi-level feature fusion based Locality-Constrained Spatial Transformer network for video crowd counting
    Fang Y.
    Gao S.
    Li J.
    Luo W.
    He L.
    Hu B.
    Neurocomputing, 2020, 392 : 98 - 107
  • [22] MULTI-STEP QUANTIZATION OF A MULTI-SCALE NETWORK FOR CROWD COUNTING
    Shim, Kyujin
    Byun, Junyoung
    Kim, Changick
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 683 - 687
  • [23] Dense Scale Network for Crowd Counting
    Dai, Feng
    Liu, Hao
    Ma, Yike
    Zhang, Xi
    Zhao, Qiang
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 64 - 72
  • [24] Scale Pyramid Network for Crowd Counting
    Chen, Xinya
    Bin, Yanrui
    Sang, Nong
    Gao, Changxin
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1941 - 1950
  • [25] A scale adaptive network for crowd counting
    Zhang, Youmei
    Zhou, Chunluan
    Chang, Faliang
    Kot, Alex C.
    NEUROCOMPUTING, 2019, 362 : 139 - 146
  • [26] Crowd Counting Method Based on Multi-Scale Enhanced Network
    Xu Tao
    Duan Yinong
    Du Jiahao
    Liu Caihua
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1764 - 1771
  • [27] A multi-scale fusion and dual attention network for crowd counting
    De Zhang
    Yiting Wang
    Xiaoping Zhou
    Liangliang Su
    Multimedia Tools and Applications, 2025, 84 (13) : 11269 - 11294
  • [28] Dense Crowd Counting Network Based on Multi-scale Perception
    Li, Hengchao
    Liu, Xianglian
    Liu, Peng
    Feng, Bin
    Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2024, 59 (05): : 1176 - 1183
  • [29] MHANet: Multi-scale hybrid attention network for crowd counting
    Yu, Ying
    Yu, Jiamao
    Qian, Jin
    Zhu, Zhiliang
    Han, Xing
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 9445 - 9455
  • [30] Double multi-scale feature fusion network for crowd counting
    Liu, Qian
    Fang, Jiongtao
    Zhong, Yixiong
    Wang, Cunbao
    Qi, Youwei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (34) : 81831 - 81855