Joint Deep and Depth for Object-Level Segmentation and Stereo Tracking in Crowds

被引：14

作者：

Li, Jing ^{[1
]}

Wei, Lisong ^{[1
]}

Zhang, Fangbing ^{[2
]}

Yang, Tao ^{[2
]}

Lu, Zhaoyang ^{[1
]}

机构：

[1] Xidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R China

[2] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Shaanxi, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2019年 / 21卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Multiple object tracking; object-level segmentation; stereo vision; severe occlusion; PARTICLE PHD FILTER; MULTIPLE;

D O I：

10.1109/TMM.2019.2908350

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Tracking multiple people in crowds is a fundamental and essential task in the multimedia field. It is often hindered by difficulties, such as dynamic occlusion between objects, cluttered background, and abrupt illumination changes. To respond to this need, in this paper, we combine deep and depth to build a stereo tracking system for crowds. The core of the system is the fusion of the advantages of deep learning and depth information, which is exploited to achieve object segmentation and improve the multiobject tracking performance in severe occlusion. More specifically, first, to obtain more accurate detection observations in the tracking system, we present a novel object-level segmentation method. This method combines the effective detection results of deep learning with depth information to obtain precise object segmentation results. Then, we integrate the segmentation results and three-dimensional (3-D) information to extract 2-D and 3-D characteristics to represent the target, and design three similarity models to realize a stereo tracking method through data association in crowds. Finally, we build a diverse stereo dataset including various challenging indoor and outdoor scenes. The comprehensive experiments verify the effective and robust tracking performance of our system in various scenarios, and the system has rich output results including segmentation results, target distance, and tracking results. Moreover, the qualitative and quantitative comparison results show that the proposed algorithm not only has good object segmentation performance but also improves the tracking performance of completely and partially occluded objects, which is superior to the tested state-of-the-art tracking approaches.

引用

页码：2531 / 2544

页数：14

共 50 条

[1] Object Detection and Tracking Under Occlusion for Object-Level RGB-D Video Segmentation
Xie, Qian
Remil, Oussama
Guo, Yanwen
Wang, Meng
Wei, Mingqiang
Wang, Jun
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (03) : 580 - 592
[2] Object Stereo-Joint Stereo Matching and Object Segmentation
Bleyer, Michael
Rother, Carsten
Kohli, Pushmeet
Scharstein, Daniel
Sinha, Sudipta
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
[3] Object-Level Image Segmentation Using Low Level Cues
Zhu, Hongyuan
Zheng, Jianmin
Cai, Jianfei
Thalmann, Nadia M.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (10) : 4019 - 4027
[4] Joint Object Segmentation and Depth Upsampling
Huang, Wenqi
Gong, Xiaojin
Yang, Michael Ying
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (02) : 192 - 196
[5] Multiclass Semantic Video Segmentation with Object-level Active Inference
Liu, Buyu
He, Xuming
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 4286 - 4294
[6] Object-Level Segmentation of Indoor Point Clouds by the Convexity of Adjacent Object Regions
Luo, Nan
Wang, Quan
Wei, Qi
Jing, Chuan
IEEE ACCESS, 2019, 7 : 171934 - 171949
[7] Sparse Object-level Supervision for Instance Segmentation with Pixel Embeddings
Wolny, Adrian
Yu, Qin
Pape, Constantin
Kreshuk, Anna
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4392 - 4401
[8] ObjectAug: Object-level Data Augmentation for Semantic Image Segmentation
Zhang, Jiawei
Zhang, Yanchun
Xu, Xiaowei
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[9] Fusion of Local Appearance with Stereo Depth for Object Tracking
Tang, Feng
Harville, Michael
Tao, Hai
Robinson, Ian N.
2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 651 - +
[10] Object-Level Targeted Selection via Deep Template Matching
Kothawade, Suraj
Roy, Donna
Fenzi, Michele
Haussmann, Elmar
Alvarez, Jose M.
Angerer, Christoph
2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 766 - 771

← 1 2 3 4 5 →