END-TO-END CROWD COUNTING VIA JOINT LEARNING LOCAL AND GLOBAL COUNT

被引：0

作者：

Shang, Chong ^{[1
]}

Ai, Haizhou ^{[1
]}

Bai, Bo ^{[2
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Tech, Tsinghua Natl Lab Info Sci & Tech, Beijing, Peoples R China

[2] Huawei Technol, Beijing, Peoples R China

来源：

2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2016年

关键词：

Crowd counting; end-to-end; CNN;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Crowd counting is a very challenging task in crowded scenes due to heavy occlusions, appearance variations and perspective distortions. Current crowd counting methods typically operate on an image patch level with overlaps, then sum over the patches to get the final count. In this paper, we propose an end-to-end convolutional neural network (CNN) architecture that takes a whole image as its input and directly outputs the counting result. While making use of sharing computations over overlapping regions, our method takes advantages of contextual information when predicting both local and global count. In particular, we first feed the image to a pre-trained CNN to get a set of high level features. Then the features are mapped to local counting numbers using recurrent network layers with memory cells. We perform the experiments on several challenging crowd counting datasets, which achieve the state-of-the-art results and demonstrate the effectiveness of our method.

引用

页码：1215 / 1219

页数：5

共 50 条

[21] End-to-end global to local convolutional neural network learning for hand pose recovery in depth data
Madadi, Meysam
Escalera, Sergio
Baro, Xavier
Gonzalez, Jordi
IET COMPUTER VISION, 2022, 16 (01) : 50 - 66
[22] End-to-end Active Object Tracking via Reinforcement Learning
Luo, Wenhan
Sun, Peng
Zhong, Fangwei
Liu, Wei
Zhang, Tong
Wang, Yizhou
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[23] End-to-end multimodal image registration via reinforcement learning
Hu, Jing
Luo, Ziwei
Wang, Xin
Sun, Shanhui
Yin, Youbing
Cao, Kunlin
Song, Qi
Lyu, Siwei
Wu, Xi
MEDICAL IMAGE ANALYSIS, 2021, 68
[24] Spatial Signal Design for Positioning via End-to-End Learning
Rivetti, Steven
Miguel Mateos-Ramos, Jose
Wu, Yibo
Song, Jinxiang
Keskin, Musa Furkan
Yajnanarayana, Vijaya
Hager, Christian
Wymeersch, Henk
IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (03) : 525 - 529
[25] Joint Bayesian guided metric learning for end-to-end face verification
Chen, Di
Xu, Chunyan
Yang, Jian
Qian, Jianjun
Zheng, Yuhui
Shen, Linlin
NEUROCOMPUTING, 2018, 275 : 560 - 567
[26] END-TO-END JOINT LEARNING OF NATURAL LANGUAGE UNDERSTANDING AND DIALOGUE MANAGER
Yang, Xuesong
Chen, Yun-Nung
Hakkani-Tur, Dilek
Crook, Paul
Li, Xiujun
Gao, Jianfeng
Deng, Li
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5690 - 5694
[27] Arbitrary perspective crowd counting via local to global algorithm
Hu, Chuanrui
Cheng, Kai
Xie, Yixiang
Li, Teng
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 15059 - 15071
[28] Arbitrary perspective crowd counting via local to global algorithm
Chuanrui Hu
Kai Cheng
Yixiang Xie
Teng Li
Multimedia Tools and Applications, 2020, 79 : 15059 - 15071
[29] End-to-End Incremental Learning
Castro, Francisco M.
Marin-Jimenez, Manuel J.
Guil, Nicolas
Schmid, Cordelia
Alahari, Karteek
COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 241 - 257
[30] A Compact End-to-End Model with Local and Global Context for Spoken Language Identification
Jia, Fei
Koluguri, Nithin Rao
Balam, Jagadeesh
Ginsburg, Boris
INTERSPEECH 2023, 2023, : 5321 - 5325

← 1 2 3 4 5 →