Spatiotemporal Semantic Video Segmentation

被引:0
|
作者
Galmar, E. [1 ]
Athanasiadis, Th [1 ]
Huet, B. [2 ]
Avrithis, Y. [2 ]
机构
[1] Eurecom, Dept Multimedia, Sophia Antipolis, France
[2] NTUA, Image Video & Multimedia Syst Lab, Athens, Greece
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a framework to extend semantic labeling of images to video shot sequences and achieve efficient and semantic-aware spatiotemporal video segmentation. This task faces two major challenges, namely the temporal variations within a video sequence which affect image segmentation and labeling, and the computational cost of region labeling. Guided by these limitations, we design a method where spatiotemporal segmentation and object labeling are coupled to achieve semantic annotation of video shots. An internal graph structure that describes both visual and semantic properties of image and video regions is adopted. The process of spatiotemporal semantic segmentation is subdivided in two stages: Firstly, the video shot is split into small block of frames. Spatiotemporal regions (volumes) are extracted and labeled individually within each block. Then, we iteratively merge consecutive blocks by a matching procedure which considers both semantic and visual properties. Results on real video sequences show the potential of our approach.
引用
收藏
页码:578 / +
页数:2
相关论文
共 50 条
  • [31] An Attention based Method for Video Semantic Segmentation
    Huang, Yuan
    Huang, Qian
    Huang, Shuai
    Li, Yanping
    TWELFTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2020), 2020, 11519
  • [32] Temporal information integration for video semantic segmentation
    Guarino, G.
    Chateau, T.
    Teuliere, C.
    Antoine, V
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 8545 - 8551
  • [33] Discriminative Feature Learning for Video Semantic Segmentation
    Zhang, Han
    Jiang, Kai
    Zhang, Yu
    Li, Qing
    Xia, Changqun
    Chen, Xiaowu
    2014 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV2014), 2014, : 321 - 326
  • [34] Efficient spatiotemporal segmentation and video object generation for highway surveillance video
    Shi, R
    Li, XF
    Li, ZM
    2002 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS AND WEST SINO EXPOSITION PROCEEDINGS, VOLS 1-4, 2002, : 580 - 584
  • [35] Semiautomatic segmentation and tracking of semantic video objects
    Gu, C
    Lee, MC
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) : 572 - 584
  • [36] Unsupervised video object segmentation by spatiotemporal graphical model
    Lijun Guo
    Tingting Cheng
    Yuanjie Huang
    Jieyu Zhao
    Rong Zhang
    Multimedia Tools and Applications, 2017, 76 : 1037 - 1053
  • [37] Unsupervised video object segmentation by spatiotemporal graphical model
    Guo, Lijun
    Cheng, Tingting
    Huang, Yuanjie
    Zhao, Jieyu
    Zhang, Rong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (01) : 1037 - 1053
  • [38] SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation
    Duke, Brendan
    Ahmed, Abdalla
    Wolf, Christian
    Aarabi, Parham
    Taylor, Graham W.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5908 - 5917
  • [39] Video segmentation using spatiotemporal Markov random field
    Hwang, SW
    Kim, EY
    Yun, TS
    Kim, HJ
    COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2000, : 349 - 352
  • [40] Selecting salient frames for spatiotemporal video modeling and segmentation
    Song, Xiaomu
    Fan, Guoliang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (12) : 3035 - 3046