Spatiotemporal Semantic Video Segmentation

被引:0
|
作者
Galmar, E. [1 ]
Athanasiadis, Th [1 ]
Huet, B. [2 ]
Avrithis, Y. [2 ]
机构
[1] Eurecom, Dept Multimedia, Sophia Antipolis, France
[2] NTUA, Image Video & Multimedia Syst Lab, Athens, Greece
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a framework to extend semantic labeling of images to video shot sequences and achieve efficient and semantic-aware spatiotemporal video segmentation. This task faces two major challenges, namely the temporal variations within a video sequence which affect image segmentation and labeling, and the computational cost of region labeling. Guided by these limitations, we design a method where spatiotemporal segmentation and object labeling are coupled to achieve semantic annotation of video shots. An internal graph structure that describes both visual and semantic properties of image and video regions is adopted. The process of spatiotemporal semantic segmentation is subdivided in two stages: Firstly, the video shot is split into small block of frames. Spatiotemporal regions (volumes) are extracted and labeled individually within each block. Then, we iteratively merge consecutive blocks by a matching procedure which considers both semantic and visual properties. Results on real video sequences show the potential of our approach.
引用
收藏
页码:578 / +
页数:2
相关论文
共 50 条
  • [41] Integration of audio and video semantic features for news video scene segmentation
    Xu, J
    Liu, HB
    Zhou, DR
    VISUALIZATION AND OPTIMIZATION TECHNIQUES, 2001, 4553 : 227 - 232
  • [42] Spatiotemporal MRF approach to video segmentation:: Application to motion detection and lip segmentation
    Luthon, F
    Caplier, A
    Liévin, M
    SIGNAL PROCESSING, 1999, 76 (01) : 61 - 80
  • [43] Spatiotemporal MRF approach to video segmentation: Application to motion detection and lip segmentation
    Lab. des Images et des Signaux, Inst. Natl. Polytech. de Grenoble, INPG, 46 avenue Félix-Viallet, 38031 Grenoble Cedex, France
    Signal Process, 1 (61-80):
  • [44] Electrophoretic video display based on image semantic segmentation
    Zhang, Heng
    Li, Shi-Xiao
    Chen, Jian-Wen
    Wang, Zi-Yang
    Bo, Xiao
    Gao, Rui-Si
    Bai, Peng-Fei
    Zhou, Guo-Fu
    JOURNAL OF THE SOCIETY FOR INFORMATION DISPLAY, 2025, 33 (01) : 34 - 45
  • [45] Semantic Single Video Segmentation with Robust Graph Representation
    Zhao, Handong
    Fu, Yun
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 2219 - 2225
  • [46] Local Memory Attention for Fast Video Semantic Segmentation
    Paul, Matthieu
    Danelljan, Martin
    Van Gool, Luc
    Timofte, Radu
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 1102 - 1109
  • [47] Architecture Search of Dynamic Cells for Semantic Video Segmentation
    Nekrasov, Vladimir
    Chen, Hao
    Shen, Chunhua
    Reid, Ian
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1959 - 1968
  • [48] Dual Correlation Network for Efficient Video Semantic Segmentation
    An, Shumin
    Liao, Qingmin
    Lu, Zongqing
    Xue, Jing-Hao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1572 - 1585
  • [49] A Novel Scheme for Video Scenes Segmentation and Semantic Representation
    Zhu, Songhao
    Liu, Yuncai
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1289 - 1292
  • [50] Attention-Guided Network for Semantic Video Segmentation
    Li, Jiangyun
    Zhao, Yikai
    Fu, Jun
    Wu, Jiajia
    Liu, Jing
    IEEE ACCESS, 2019, 7 : 140680 - 140689