Joint Deep and Depth for Object-Level Segmentation and Stereo Tracking in Crowds

被引:14
|
作者
Li, Jing [1 ]
Wei, Lisong [1 ]
Zhang, Fangbing [2 ]
Yang, Tao [2 ]
Lu, Zhaoyang [1 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Multiple object tracking; object-level segmentation; stereo vision; severe occlusion; PARTICLE PHD FILTER; MULTIPLE;
D O I
10.1109/TMM.2019.2908350
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Tracking multiple people in crowds is a fundamental and essential task in the multimedia field. It is often hindered by difficulties, such as dynamic occlusion between objects, cluttered background, and abrupt illumination changes. To respond to this need, in this paper, we combine deep and depth to build a stereo tracking system for crowds. The core of the system is the fusion of the advantages of deep learning and depth information, which is exploited to achieve object segmentation and improve the multiobject tracking performance in severe occlusion. More specifically, first, to obtain more accurate detection observations in the tracking system, we present a novel object-level segmentation method. This method combines the effective detection results of deep learning with depth information to obtain precise object segmentation results. Then, we integrate the segmentation results and three-dimensional (3-D) information to extract 2-D and 3-D characteristics to represent the target, and design three similarity models to realize a stereo tracking method through data association in crowds. Finally, we build a diverse stereo dataset including various challenging indoor and outdoor scenes. The comprehensive experiments verify the effective and robust tracking performance of our system in various scenarios, and the system has rich output results including segmentation results, target distance, and tracking results. Moreover, the qualitative and quantitative comparison results show that the proposed algorithm not only has good object segmentation performance but also improves the tracking performance of completely and partially occluded objects, which is superior to the tested state-of-the-art tracking approaches.
引用
收藏
页码:2531 / 2544
页数:14
相关论文
共 50 条
  • [1] Object Detection and Tracking Under Occlusion for Object-Level RGB-D Video Segmentation
    Xie, Qian
    Remil, Oussama
    Guo, Yanwen
    Wang, Meng
    Wei, Mingqiang
    Wang, Jun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (03) : 580 - 592
  • [2] Object Stereo-Joint Stereo Matching and Object Segmentation
    Bleyer, Michael
    Rother, Carsten
    Kohli, Pushmeet
    Scharstein, Daniel
    Sinha, Sudipta
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [3] Object-Level Image Segmentation Using Low Level Cues
    Zhu, Hongyuan
    Zheng, Jianmin
    Cai, Jianfei
    Thalmann, Nadia M.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (10) : 4019 - 4027
  • [4] Joint Object Segmentation and Depth Upsampling
    Huang, Wenqi
    Gong, Xiaojin
    Yang, Michael Ying
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (02) : 192 - 196
  • [5] Multiclass Semantic Video Segmentation with Object-level Active Inference
    Liu, Buyu
    He, Xuming
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 4286 - 4294
  • [6] Object-Level Segmentation of Indoor Point Clouds by the Convexity of Adjacent Object Regions
    Luo, Nan
    Wang, Quan
    Wei, Qi
    Jing, Chuan
    IEEE ACCESS, 2019, 7 : 171934 - 171949
  • [7] Sparse Object-level Supervision for Instance Segmentation with Pixel Embeddings
    Wolny, Adrian
    Yu, Qin
    Pape, Constantin
    Kreshuk, Anna
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4392 - 4401
  • [8] ObjectAug: Object-level Data Augmentation for Semantic Image Segmentation
    Zhang, Jiawei
    Zhang, Yanchun
    Xu, Xiaowei
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] Fusion of Local Appearance with Stereo Depth for Object Tracking
    Tang, Feng
    Harville, Michael
    Tao, Hai
    Robinson, Ian N.
    2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 651 - +
  • [10] Object-Level Targeted Selection via Deep Template Matching
    Kothawade, Suraj
    Roy, Donna
    Fenzi, Michele
    Haussmann, Elmar
    Alvarez, Jose M.
    Angerer, Christoph
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 766 - 771