Spatiotemporal Semantic Video Segmentation

被引：0

作者：

Galmar, E. ^{[1
]}

Athanasiadis, Th ^{[1
]}

Huet, B. ^{[2
]}

Avrithis, Y. ^{[2
]}

机构：

[1] Eurecom, Dept Multimedia, Sophia Antipolis, France

[2] NTUA, Image Video & Multimedia Syst Lab, Athens, Greece

来源：

2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2 | 2008年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a framework to extend semantic labeling of images to video shot sequences and achieve efficient and semantic-aware spatiotemporal video segmentation. This task faces two major challenges, namely the temporal variations within a video sequence which affect image segmentation and labeling, and the computational cost of region labeling. Guided by these limitations, we design a method where spatiotemporal segmentation and object labeling are coupled to achieve semantic annotation of video shots. An internal graph structure that describes both visual and semantic properties of image and video regions is adopted. The process of spatiotemporal semantic segmentation is subdivided in two stages: Firstly, the video shot is split into small block of frames. Spatiotemporal regions (volumes) are extracted and labeled individually within each block. Then, we iteratively merge consecutive blocks by a matching procedure which considers both semantic and visual properties. Results on real video sequences show the potential of our approach.

引用

页码：578 / +

页数：2

共 50 条

[41] Integration of audio and video semantic features for news video scene segmentation
Xu, J
Liu, HB
Zhou, DR
VISUALIZATION AND OPTIMIZATION TECHNIQUES, 2001, 4553 : 227 - 232
[42] Spatiotemporal MRF approach to video segmentation:: Application to motion detection and lip segmentation
Luthon, F
Caplier, A
Liévin, M
SIGNAL PROCESSING, 1999, 76 (01) : 61 - 80
[43] Spatiotemporal MRF approach to video segmentation: Application to motion detection and lip segmentation
Lab. des Images et des Signaux, Inst. Natl. Polytech. de Grenoble, INPG, 46 avenue Félix-Viallet, 38031 Grenoble Cedex, France
Signal Process, 1 (61-80):
[44] Electrophoretic video display based on image semantic segmentation
Zhang, Heng
Li, Shi-Xiao
Chen, Jian-Wen
Wang, Zi-Yang
Bo, Xiao
Gao, Rui-Si
Bai, Peng-Fei
Zhou, Guo-Fu
JOURNAL OF THE SOCIETY FOR INFORMATION DISPLAY, 2025, 33 (01) : 34 - 45
[45] Semantic Single Video Segmentation with Robust Graph Representation
Zhao, Handong
Fu, Yun
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 2219 - 2225
[46] Local Memory Attention for Fast Video Semantic Segmentation
Paul, Matthieu
Danelljan, Martin
Van Gool, Luc
Timofte, Radu
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 1102 - 1109
[47] Architecture Search of Dynamic Cells for Semantic Video Segmentation
Nekrasov, Vladimir
Chen, Hao
Shen, Chunhua
Reid, Ian
2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1959 - 1968
[48] Dual Correlation Network for Efficient Video Semantic Segmentation
An, Shumin
Liao, Qingmin
Lu, Zongqing
Xue, Jing-Hao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1572 - 1585
[49] A Novel Scheme for Video Scenes Segmentation and Semantic Representation
Zhu, Songhao
Liu, Yuncai
2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1289 - 1292
[50] Attention-Guided Network for Semantic Video Segmentation
Li, Jiangyun
Zhao, Yikai
Fu, Jun
Wu, Jiajia
Liu, Jing
IEEE ACCESS, 2019, 7 : 140680 - 140689

← 1 2 3 4 5 →