An Effective Lightweight Crowd Counting Method Based on an Encoder-Decoder Network for Internet of Video Things

被引:7
|
作者
Yi, Jun [1 ]
Chen, Fan [1 ]
Shen, Zhilong [2 ]
Xiang, Yi [1 ]
Xiao, Shan [3 ]
Zhou, Wei [1 ]
机构
[1] Chongqing Univ Sci & Technol, Coll Intelligent Technol & Engn, Chongqing 401331, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Chongqing 400065, Peoples R China
[3] Chongqing Coll Elect Engn, Inst Big Data & Optimizat, Chongqing 401331, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolution neural network; crowd counting; edge computing; lightweight network;
D O I
10.1109/JIOT.2023.3294727
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An emerging Internet of Video Things (IoVT) application, crowd counting is a computer vision task where the number of heads in a crowded scene is estimated. In recent years, it has attracted increasing attention from academia and industry because of its great potential value in public safety and urban planning. However, it has become a challenge to cross the gap between the increasingly heavy and complex network architecture widely used for the pursuit of counting with high accuracy and the constrained computing and storage resources in the edge computing environment. To address this issue, an effective lightweight crowd counting method based on an encoder-decoder network, named lightweight crowd counting network (LEDCrowdNet), is proposed to achieve an optimal tradeoff between counting performance and running speed for edge applications of IoVT. In particular, an improved MobileViT module as an encoder is designed to extract global-local crowd features of various scales. The decoder is composed of the adaptive multiscale large kernel attention module (AMLKA) and the lightweight counting atrous spatial pyramid pooling process module (LC-ASPP), which can perform end-to-end training to obtain the final density map. The proposed LEDCrowdNet is suitable for deployment on two edge computing platforms (NVIDIA Jetson Xavier NX and Coral Edge TPU) to reduce the number of floating point operations (FLOPs) without a significant drop in accuracy. Extensive experiments on five mainstream benchmarks (ShanghaiTech Part_A/B, UCF_CC_50, UCF-QNRF, WorldExpo'10, and RSOC data sets) verify the correctness and efficiency of our method.
引用
收藏
页码:3082 / 3094
页数:13
相关论文
共 50 条
  • [41] Li-SegPNet: Encoder-Decoder Mode Lightweight Segmentation Network for Colorectal Polyps Analysis
    Sharma, Pallabi
    Gautam, Anmol
    Maji, Pallab
    Pachori, Ram Bilas
    Balabantaray, Bunil Kumar
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2023, 70 (04) : 1330 - 1339
  • [42] CT metal artifact reduction based on the residual encoder-decoder network
    Ma Y.
    Yu H.
    Zhong F.
    Liu F.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2020, 41 (08): : 160 - 169
  • [43] Recognition of complex power lines based on novel encoder-decoder network
    Li Y.
    Li H.
    Zhang K.
    Wang B.
    Guan S.
    Chen Y.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (06): : 1133 - 1141
  • [44] Laser curve extraction of a train wheelset based on an encoder-decoder network
    Yang, Kai
    Luo, Shuai
    Wang, Yong
    Gao, Xiaorong
    Jiang, Tianci
    Li, Chunjiang
    Zhao, Yanyu
    APPLIED OPTICS, 2021, 60 (14) : 4074 - 4083
  • [45] Dense Video Captioning with Hierarchical Attention-Based Encoder-Decoder Networks
    Yu, Mingjing
    Zheng, Huicheng
    Liu, Zehua
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [46] ATTENTION-BASED ENCODER-DECODER NETWORK FOR SINGLE IMAGE DEHAZING
    Gao, Shunan
    Zhu, Jinghua
    Xi, Heran
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [47] Multimodal super-resolution reconstruction based on encoder-decoder network
    Wang, Bowen
    Zou, Yan
    Wang, Minqi
    OPTICS, PHOTONICS AND DIGITAL TECHNOLOGIES FOR IMAGING APPLICATIONS VII, 2022, 12138
  • [48] Spatio-Temporal Encoder-Decoder Fully Convolutional Network for Video-Based Dimensional Emotion Recognition
    Du, Zhengyin
    Wu, Suowei
    Huang, Di
    Li, Weixin
    Wang, Yunhong
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (03) : 565 - 578
  • [49] A Multi-scale Edge Detection Method Based on Encoder-Decoder
    Tian, An-Lin
    Lei, Wei-Min
    Zhang, Peng
    Zhang, Wei
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (07): : 936 - 943
  • [50] Semantic segmentation method of underwater images based on encoder-decoder architecture
    Wang, Jinkang
    He, Xiaohui
    Shao, Faming
    Lu, Guanlin
    Hu, Ruizhe
    Jiang, Qunyan
    PLOS ONE, 2022, 17 (08):