END-TO-END CROWD COUNTING VIA JOINT LEARNING LOCAL AND GLOBAL COUNT

被引:0
|
作者
Shang, Chong [1 ]
Ai, Haizhou [1 ]
Bai, Bo [2 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Tech, Tsinghua Natl Lab Info Sci & Tech, Beijing, Peoples R China
[2] Huawei Technol, Beijing, Peoples R China
来源
2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2016年
关键词
Crowd counting; end-to-end; CNN;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Crowd counting is a very challenging task in crowded scenes due to heavy occlusions, appearance variations and perspective distortions. Current crowd counting methods typically operate on an image patch level with overlaps, then sum over the patches to get the final count. In this paper, we propose an end-to-end convolutional neural network (CNN) architecture that takes a whole image as its input and directly outputs the counting result. While making use of sharing computations over overlapping regions, our method takes advantages of contextual information when predicting both local and global count. In particular, we first feed the image to a pre-trained CNN to get a set of high level features. Then the features are mapped to local counting numbers using recurrent network layers with memory cells. We perform the experiments on several challenging crowd counting datasets, which achieve the state-of-the-art results and demonstrate the effectiveness of our method.
引用
收藏
页码:1215 / 1219
页数:5
相关论文
共 50 条
  • [21] End-to-end global to local convolutional neural network learning for hand pose recovery in depth data
    Madadi, Meysam
    Escalera, Sergio
    Baro, Xavier
    Gonzalez, Jordi
    IET COMPUTER VISION, 2022, 16 (01) : 50 - 66
  • [22] End-to-end Active Object Tracking via Reinforcement Learning
    Luo, Wenhan
    Sun, Peng
    Zhong, Fangwei
    Liu, Wei
    Zhang, Tong
    Wang, Yizhou
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [23] End-to-end multimodal image registration via reinforcement learning
    Hu, Jing
    Luo, Ziwei
    Wang, Xin
    Sun, Shanhui
    Yin, Youbing
    Cao, Kunlin
    Song, Qi
    Lyu, Siwei
    Wu, Xi
    MEDICAL IMAGE ANALYSIS, 2021, 68
  • [24] Spatial Signal Design for Positioning via End-to-End Learning
    Rivetti, Steven
    Miguel Mateos-Ramos, Jose
    Wu, Yibo
    Song, Jinxiang
    Keskin, Musa Furkan
    Yajnanarayana, Vijaya
    Hager, Christian
    Wymeersch, Henk
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (03) : 525 - 529
  • [25] Joint Bayesian guided metric learning for end-to-end face verification
    Chen, Di
    Xu, Chunyan
    Yang, Jian
    Qian, Jianjun
    Zheng, Yuhui
    Shen, Linlin
    NEUROCOMPUTING, 2018, 275 : 560 - 567
  • [26] END-TO-END JOINT LEARNING OF NATURAL LANGUAGE UNDERSTANDING AND DIALOGUE MANAGER
    Yang, Xuesong
    Chen, Yun-Nung
    Hakkani-Tur, Dilek
    Crook, Paul
    Li, Xiujun
    Gao, Jianfeng
    Deng, Li
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5690 - 5694
  • [27] Arbitrary perspective crowd counting via local to global algorithm
    Hu, Chuanrui
    Cheng, Kai
    Xie, Yixiang
    Li, Teng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 15059 - 15071
  • [28] Arbitrary perspective crowd counting via local to global algorithm
    Chuanrui Hu
    Kai Cheng
    Yixiang Xie
    Teng Li
    Multimedia Tools and Applications, 2020, 79 : 15059 - 15071
  • [29] End-to-End Incremental Learning
    Castro, Francisco M.
    Marin-Jimenez, Manuel J.
    Guil, Nicolas
    Schmid, Cordelia
    Alahari, Karteek
    COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 241 - 257
  • [30] A Compact End-to-End Model with Local and Global Context for Spoken Language Identification
    Jia, Fei
    Koluguri, Nithin Rao
    Balam, Jagadeesh
    Ginsburg, Boris
    INTERSPEECH 2023, 2023, : 5321 - 5325