Fully automatic person segmentation in unconstrained video using spatio-temporal conditional random fields

被引:6
|
作者
Bhole, Chetan [1 ]
Pal, Christopher [2 ]
机构
[1] Univ Rochester, Rochester, NY 14620 USA
[2] Univ Montreal, Montreal, PQ, Canada
关键词
Person segmentation; Video segmentation; Conditional random field; Optical flow; Fully automatic; POSE ESTIMATION;
D O I
10.1016/j.imavis.2016.04.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The segmentation of objects and people in particular is an important problem in computer vision. In this paper, we focus on automatically segmenting a person from challenging video sequences in which we place no constraint on camera viewpoint, camera motion or the movements of a person in the scene. Our approach uses the most confident predictions from a pose detector as a form of anchor or keyframe stick figure prediction which helps guide the segmentation of other more challenging frames in the video. Since even state of the art pose detectors are unreliable on many frames especially given that we are interested in segmentations with no camera or motion constraints only the poses or stick figure predictions for frames with the highest confidence in a localized temporal region anchor further processing. The stick figure predictions within confident keyframes are used to extract color, position and optical flow features. Multiple conditional random fields (CRFs) are used to process blocks of video in batches, using a two dimensional CRF for detailed keyframe segmentation as well as 3D CRFs for propagating segmentations to the entire sequence of frames belonging to batches. Location information derived from the pose is also used to refine the results. Importantly, no hand labeled training data is required by our method. We discuss the use of a continuity method that reuses learnt parameters between batches of frames and show how pose predictions can also be improved by our model. We provide an extensive evaluation of our approach, comparing it with a variety of alternative grab cut based methods and a prior state of the art method. We also release our evaluation data to the community to facilitate further experiments. We find that our approach yields state of the art qualitative and quantitative performance compared to prior work and more heuristic alternative approaches. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:58 / 68
页数:11
相关论文
共 50 条
  • [31] ImaGINator: Conditional Spatio-Temporal GAN for Video Generation
    Wang, Yaohui
    Bilinski, Piotr
    Bremond, Francois
    Dantcheva, Antitza
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1149 - 1158
  • [32] SPATIO-TEMPORAL ALTIMETER WAVEFORM RETRACKING VIA SPARSE REPRESENTATION AND CONDITIONAL RANDOM FIELDS
    Roscher, Ribana
    Uebbing, Bernd
    Kusche, Juergen
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 1234 - 1237
  • [33] Automatic video background replacement using shape-based probabilistic spatio-temporal object segmentation
    Ahmed, Rakib
    Karmakar, Gour C.
    Dooley, Laurence S.
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 914 - +
  • [34] SPATIO-TEMPORAL VISUAL RECEPTIVE-FIELDS AS REVEALED BY SPATIO-TEMPORAL RANDOM NOISE
    HIDA, E
    NAKA, K
    ZEITSCHRIFT FUR NATURFORSCHUNG C-A JOURNAL OF BIOSCIENCES, 1982, 37 (10): : 1048 - 1049
  • [35] STREAMING SPATIO-TEMPORAL VIDEO SEGMENTATION USING GAUSSIAN MIXTURE MODEL
    Mukherjee, Dibyendu
    Wu, Q. M. Jonathan
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4388 - 4392
  • [36] Object-based video segmentation using spatio-temporal energy
    Bao, HQ
    Zhang, ZY
    2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 1260 - 1263
  • [37] Adaptive video segmentation algorithm using hidden conditional random fields
    Chu, Yi-Ping
    Zhang, Yin
    Ye, Xiu-Zi
    Zhang, San-Yuan
    Zidonghua Xuebao/Acta Automatica Sinica, 2007, 33 (12): : 1252 - 1258
  • [38] Illumination invariant segmentation of spatio-temporal images by spatio-temporal Markov random field model
    Kamijo, S
    Ikeuchi, K
    Sakauchi, M
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 617 - 622
  • [39] Spatio-temporal joint probability images for video segmentation
    Li, ZN
    Wei, J
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2000, : 295 - 298
  • [40] A Novel Spatio-Temporal Video Object Segmentation Algorithm
    Zhu, Shiping
    Xia, Xi
    Zhang, Qingrong
    Belloulata, Kamel
    2008 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1-5, 2008, : 1916 - +