An Adaptive Multi-Scale Network Based on Depth Information for Crowd Counting

被引:0
|
作者
Zhang, Peng [1 ]
Lei, Weimin [1 ]
Zhao, Xinlei [2 ]
Dong, Lijia [2 ]
Lin, Zhaonan [1 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110167, Peoples R China
[2] 213 Elect Technol Co Ltd, Artificial Intelligence Res Inst Shenyang, Shenyang 110023, Peoples R China
关键词
object counting; crowd counting; deep learning; CNN; SEGMENTATION; PEOPLE;
D O I
10.3390/s23187805
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Crowd counting, as a basic computer vision task, plays an important role in many fields such as video surveillance, accident prediction, public security, and intelligent transportation. At present, crowd counting tasks face various challenges. Firstly, due to the diversity of crowd distribution and increasing population density, there is a phenomenon of large-scale crowd aggregation in public places, sports stadiums, and stations, resulting in very serious occlusion. Secondly, when annotating large-scale datasets, positioning errors can also easily affect training results. In addition, the size of human head targets in dense images is not consistent, making it difficult to identify both near and far targets using only one network simultaneously. The existing crowd counting methods mainly use density plot regression methods. However, this framework does not distinguish the features between distant and near targets and cannot adaptively respond to scale changes. Therefore, the detection performance in areas with sparse population distribution is not good. To solve such problems, we propose an adaptive multi-scale far and near distance network based on the convolutional neural network (CNN) framework for counting dense populations and achieving a good balance between accuracy, inference speed, and performance. However, on the feature level, in order to enable the model to distinguish the differences between near and far features, we use stacked convolution layers to deepen the depth of the network, allocate different receptive fields according to the distance between the target and the camera, and fuse the features between nearby targets to enhance the feature extraction ability of pedestrians under nearby targets. Secondly, depth information is used to distinguish distant and near targets of different scales and the original image is cut into four different patches to perform pixel-level adaptive modeling on the population. In addition, we add density normalized average precision (nAP) indicators to analyze the accuracy of our method in spatial positioning. This paper validates the effectiveness of NF-Net on three challenging benchmarks in Shanghai Tech Part A and B, UCF_ CC_50, and UCF-QNRF datasets. Compared with SOTA, it has more significant performance in various scenarios. In the UCF-QNRF dataset, it is further validated that our method effectively solves the interference of complex backgrounds.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] MSNet: Multi-scale Network for Crowd Counting
    Shi, Ying
    Sang, Jun
    Alam, Mohammad S.
    Liu, Xinyue
    Tian, Shaoli
    PATTERN RECOGNITION AND TRACKING XXXII, 2021, 11735
  • [2] Multi-scale supervised network for crowd counting
    Wang, Yongjie
    Zhang, Wei
    Huang, Dongxiao
    Liu, Yanyan
    Zhu, Jianghua
    IET IMAGE PROCESSING, 2020, 14 (17) : 4701 - 4707
  • [3] Crowd Counting Method Based on Multi-Scale Enhanced Network
    Xu Tao
    Duan Yinong
    Du Jiahao
    Liu Caihua
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1764 - 1771
  • [4] Dense Crowd Counting Network Based on Multi-scale Perception
    Li, Hengchao
    Liu, Xianglian
    Liu, Peng
    Feng, Bin
    Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2024, 59 (05): : 1176 - 1183
  • [5] People Counting Based on Multi-scale Region Adaptive Segmentation and Depth Neural Network
    Min, Feng
    Wang, Yansong
    Zhu, Sicheng
    AIPR 2020: 2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, 2020, : 79 - 83
  • [6] Redesigning Multi-Scale Neural Network for Crowd Counting
    Du, Zhipeng
    Shi, Miaojing
    Deng, Jiankang
    Zafeiriou, Stefanos
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3664 - 3678
  • [7] Multi-Scale Guided Attention Network for Crowd Counting
    Li, Pengfei
    Zhang, Min
    Wan, Jian
    Jiang, Ming
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [8] Multi-scale Attention Recalibration Network for crowd counting
    Xie, Jinyang
    Pang, Chen
    Zheng, Yanjun
    Li, Liang
    Lyu, Chen
    Lyu, Lei
    Liu, Hong
    APPLIED SOFT COMPUTING, 2022, 117
  • [9] STOCHASTIC MULTI-SCALE AGGREGATION NETWORK FOR CROWD COUNTING
    Wang, Mingjie
    Cai, Hao
    Zhou, Jun
    Gong, Minglun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2008 - 2012
  • [10] LigMSANet: Lightweight multi-scale adaptive convolutional neural network for dense crowd counting
    Jiang, Guoquan
    Wu, Rui
    Huo, Zhanqiang
    Zhao, Cuijun
    Luo, Junwei
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 197