Learning Pixel-Level and Instance-Level Context-Aware Features for Pedestrian Detection in Crowds

被引:11
|
作者
Fei, Chi [1 ]
Liu, Bin [1 ]
Chen, Zhu [1 ]
Yu, Nenghai [1 ]
机构
[1] Univ Sci & Technol China, Chinese Acad Sci, Sch Informat Sci & Technol, Key Lab Electromagnet Space Informat, Hefei 230026, Anhui, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
基金
中国国家自然科学基金;
关键词
Pedestrian detection; context; pixel-level; instance-level;
D O I
10.1109/ACCESS.2019.2928879
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pedestrian detection in crowded scenes is an intractable problem in computer vision, in which occlusion often presents a great challenge. In this paper, we propose a novel context-aware feature learning method for detecting pedestrians in crowds, with the purpose of making better use of context information for dealing with occlusion. Unlike most current pedestrian detectors that only extract context information from a single and fixed region, a new pixel-level context embedding module is developed to integrate multi-cue context into a deep CNN feature hierarchy, which enables access to the context of various regions by multi-branch convolution layers with different receptive fields. In addition, to utilize the distinctive visual characteristics formed by pedestrians that appear in groups and occlude each other, we propose a novel instance-level context prediction module which is actually implemented by a two-person detector, to improve the one-person detection performance. Applying with these strategies, we achieve an efficient and lightweight detector that can be trained in an end-to-end fashion. We evaluate the proposed approach on two popular pedestrian detection datasets, i.e., Caltech and CityPersons. The extensive experimental results demonstrate the effectiveness of the proposed method, especially under heavy occlusion cases.
引用
收藏
页码:94944 / 94953
页数:10
相关论文
共 50 条
  • [1] Pixel-Level Encoding and Depth Layering for Instance-Level Semantic Labeling
    Uhrig, Jonas
    Cordts, Marius
    Franke, Uwe
    Brox, Thomas
    PATTERN RECOGNITION, GCPR 2016, 2016, 9796 : 14 - 25
  • [2] Instance-level Context Attention Network for instance segmentation
    Shang, Chao
    Li, Hongliang
    Meng, Fanman
    Qiu, Heqian
    Wu, Qingbo
    Xu, Linfeng
    Ngan, King Ngi
    NEUROCOMPUTING, 2022, 472 : 124 - 137
  • [3] Unveiling image source: Instance-level camera device linking via context-aware deep Siamese network
    Zheng, Mingjie
    Law, Ngai Fong
    Siu, Wan-Chi
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 262
  • [4] Instance-Level Contrastive Learning for Weakly Supervised Object Detection
    Zhang, Ming
    Zeng, Bing
    SENSORS, 2022, 22 (19)
  • [5] Learning Pixel-Level Distinctions for Video Highlight Detection
    Wei, Fanyue
    Wang, Biao
    Ge, Tiezheng
    Jiang, Yuning
    Li, Wen
    Duan, Lixin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3063 - 3072
  • [6] APLCNet: Automatic Pixel-Level Crack Detection Network Based on Instance Segmentation
    Zhang, Yuefei
    Chen, Bin
    Wang, Jinfei
    Li, Jianming
    Sun, Xiaofei
    IEEE ACCESS, 2020, 8 : 199159 - 199170
  • [7] Pixel-Level Hand Detection with Shape-Aware Structured Forests
    Zhu, Xiaolong
    Jia, Xuhui
    Wong, Kwan-Yee K.
    COMPUTER VISION - ACCV 2014, PT IV, 2015, 9006 : 64 - 78
  • [8] EXPLORING INSTANCE-LEVEL UNCERTAINTY FOR MEDICAL DETECTION
    Yang, Jiawei
    Liang, Yuan
    Zhang, Yao
    Song, Weinan
    Wang, Kun
    He, Lei
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 448 - 452
  • [9] Adaptive spatial pixel-level feature fusion network for multispectral pedestrian detection
    Fu, Lei
    Gu, Wen-bin
    Ai, Yong-bao
    Li, Wei
    Wang, Dong
    INFRARED PHYSICS & TECHNOLOGY, 2021, 116
  • [10] Adaptive spatial pixel-level feature fusion network for multispectral pedestrian detection
    Fu, Lei
    Gu, Wen-bin
    Ai, Yong-bao
    Li, Wei
    Wang, Dong
    Infrared Physics and Technology, 2021, 116