Spatiotemporal Semantic Video Segmentation

被引：0

作者：

Galmar, E. ^{[1
]}

Athanasiadis, Th ^{[1
]}

Huet, B. ^{[2
]}

Avrithis, Y. ^{[2
]}

机构：

[1] Eurecom, Dept Multimedia, Sophia Antipolis, France

[2] NTUA, Image Video & Multimedia Syst Lab, Athens, Greece

来源：

2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2 | 2008年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a framework to extend semantic labeling of images to video shot sequences and achieve efficient and semantic-aware spatiotemporal video segmentation. This task faces two major challenges, namely the temporal variations within a video sequence which affect image segmentation and labeling, and the computational cost of region labeling. Guided by these limitations, we design a method where spatiotemporal segmentation and object labeling are coupled to achieve semantic annotation of video shots. An internal graph structure that describes both visual and semantic properties of image and video regions is adopted. The process of spatiotemporal semantic segmentation is subdivided in two stages: Firstly, the video shot is split into small block of frames. Spatiotemporal regions (volumes) are extracted and labeled individually within each block. Then, we iteratively merge consecutive blocks by a matching procedure which considers both semantic and visual properties. Results on real video sequences show the potential of our approach.

引用

页码：578 / +

页数：2

共 50 条

[31] An Attention based Method for Video Semantic Segmentation
Huang, Yuan
Huang, Qian
Huang, Shuai
Li, Yanping
TWELFTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2020), 2020, 11519
[32] Temporal information integration for video semantic segmentation
Guarino, G.
Chateau, T.
Teuliere, C.
Antoine, V
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 8545 - 8551
[33] Discriminative Feature Learning for Video Semantic Segmentation
Zhang, Han
Jiang, Kai
Zhang, Yu
Li, Qing
Xia, Changqun
Chen, Xiaowu
2014 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV2014), 2014, : 321 - 326
[34] Efficient spatiotemporal segmentation and video object generation for highway surveillance video
Shi, R
Li, XF
Li, ZM
2002 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS AND WEST SINO EXPOSITION PROCEEDINGS, VOLS 1-4, 2002, : 580 - 584
[35] Semiautomatic segmentation and tracking of semantic video objects
Gu, C
Lee, MC
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (05) : 572 - 584
[36] Unsupervised video object segmentation by spatiotemporal graphical model
Lijun Guo
Tingting Cheng
Yuanjie Huang
Jieyu Zhao
Rong Zhang
Multimedia Tools and Applications, 2017, 76 : 1037 - 1053
[37] Unsupervised video object segmentation by spatiotemporal graphical model
Guo, Lijun
Cheng, Tingting
Huang, Yuanjie
Zhao, Jieyu
Zhang, Rong
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (01) : 1037 - 1053
[38] SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation
Duke, Brendan
Ahmed, Abdalla
Wolf, Christian
Aarabi, Parham
Taylor, Graham W.
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5908 - 5917
[39] Video segmentation using spatiotemporal Markov random field
Hwang, SW
Kim, EY
Yun, TS
Kim, HJ
COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2000, : 349 - 352
[40] Selecting salient frames for spatiotemporal video modeling and segmentation
Song, Xiaomu
Fan, Guoliang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (12) : 3035 - 3046

← 1 2 3 4 5 →