Temporal stabilization of video object segmentation for 3D-TV applications

被引:4
|
作者
Erdem, CE
Ernst, F
Redert, A
Hendriks, E
机构
[1] TEKSEB, MAM, TUBITAK, Momentum Digital Media Technol,Res Dept, Gebze, Kocaeli, Turkey
[2] Philips Res Labs, NL-5656 AA Eindhoven, Netherlands
[3] Delft Univ Technol, Dept Elect Engn Math & Comp Sci, Informat & Commun Theory Grp, NL-2628 CD Delft, Netherlands
关键词
object segmentation; object tracking; temporal stabilization; 3D TV; curve evolution; snakes;
D O I
10.1016/j.image.2004.10.005
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Our aim is to insert depth information into an existing 2D video sequence to provide content for 3D-TV applications, which we try to achieve through segmentation of the objects in the given 2D video sequence. To this effect, we present a method for temporal stabilization of video object segmentation algorithms for 3D-TV applications. First, two quantitative measures to evaluate temporal stability without ground-truth are discussed. Then, a pseudo-3D curve evolution method, which spatio-temporally stabilizes the estimated segmentation of a video object is introduced. Temporal stability is achieved by re-distributing existing object segmentation errors such that they will be less disturbing when the scene is rendered and viewed in 3D. Our starting point is the hypothesis that if making segmentation errors is inevitable, these errors should be made in a temporally consistent way for 3D-TV applications. This hypothesis is supported by the experiments, which show that there is significant improvement in segmentation quality both in terms of the objective quantitative measures and in terms of the viewing comfort in subjective perceptual tests. Therefore, it is possible to increase the perceptual object segmentation quality without increasing the actual segmentation accuracy. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:151 / 167
页数:17
相关论文
共 50 条
  • [31] Video Object Segmentation without Temporal Information
    Maninis, Kevis-Kokitsi
    Caelles, Sergi
    Chen, Yuhua
    Pont-Tuset, Jordi
    Leal-Taixe, Laura
    Cremers, Daniel
    Van Gool, Luc
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (06) : 1515 - 1530
  • [32] Video Object Segmentation with Weakly Temporal Information
    Zhang, Yikun
    Yao, Rui
    Jiang, Qingnan
    Zhang, Changbin
    Wang, Shi
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (03): : 1434 - 1449
  • [33] Depth Map Post-Processing for 3D-TV
    Gangwal, Om Prakash
    Berretty, Robert-Paul
    2009 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 2009, : 283 - +
  • [34] On 3D-TV production: Stereoscopic 3D production; latest topics
    Ono H.
    Naito K.
    Kojima Y.
    Otsuka T.
    Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2010, 64 (11): : 1553 - 1556
  • [35] Video Object Segmentation with 3D Convolution Network
    Tang, Huiyun
    Tao, Pin
    Ma, Rui
    Shi, Yuanchun
    ICCCV 2019: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON CONTROL AND COMPUTER VISION, 2019, : 28 - 32
  • [36] 3D-TV Production From Conventional Cameras for Sports Broadcast
    Hilton, Adrian
    Guillemaut, Jean-Yves
    Kilner, Joe
    Grau, Oliver
    Thomas, Graham
    IEEE TRANSACTIONS ON BROADCASTING, 2011, 57 (02) : 462 - 476
  • [37] A novel coding scheme for full parallax 3D-TV pictures
    Forman, MC
    Aggoun, A
    McCormick, M
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 2945 - 2947
  • [38] Consideration about HMD-type holography 3D-TV
    Sato, K
    Takano, K
    THREE-DIMENSIONAL TV, VIDEO, AND DISPLAYS III, 2004, 5599 : 123 - 134
  • [39] Compression of full parallax integral 3D-TV image data
    Forman, MC
    Aggoun, A
    STEREOSCOPIC DISPLAYS AND VIRTUAL REALITY SYSTEMS IV, 1997, 3012 : 222 - 226
  • [40] Temporal Collection and Distribution for Referring Video Object Segmentation
    Tang, Jiajin
    Zheng, Ge
    Yang, Sibei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15420 - 15430