Spatiotemporal Semantic Video Segmentation

被引:0
|
作者
Galmar, E. [1 ]
Athanasiadis, Th [1 ]
Huet, B. [2 ]
Avrithis, Y. [2 ]
机构
[1] Eurecom, Dept Multimedia, Sophia Antipolis, France
[2] NTUA, Image Video & Multimedia Syst Lab, Athens, Greece
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a framework to extend semantic labeling of images to video shot sequences and achieve efficient and semantic-aware spatiotemporal video segmentation. This task faces two major challenges, namely the temporal variations within a video sequence which affect image segmentation and labeling, and the computational cost of region labeling. Guided by these limitations, we design a method where spatiotemporal segmentation and object labeling are coupled to achieve semantic annotation of video shots. An internal graph structure that describes both visual and semantic properties of image and video regions is adopted. The process of spatiotemporal semantic segmentation is subdivided in two stages: Firstly, the video shot is split into small block of frames. Spatiotemporal regions (volumes) are extracted and labeled individually within each block. Then, we iteratively merge consecutive blocks by a matching procedure which considers both semantic and visual properties. Results on real video sequences show the potential of our approach.
引用
收藏
页码:578 / +
页数:2
相关论文
共 50 条
  • [1] SPATIOTEMPORAL SEGMENTATION FOR STEREOSCOPIC VIDEO
    Zhao, Yu
    Liu, Yebin
    Dai, Qionghai
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2014,
  • [2] Spatiotemporal segmentation for compact video representation
    Fan, JP
    Yu, J
    Fujita, G
    Onoye, T
    Wu, L
    Shirakawa, I
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2001, 16 (06) : 553 - 566
  • [3] Spatiotemporal CNN for Video Object Segmentation
    Xu, Kai
    Wen, Longyin
    Li, Guorong
    Bo, Liefeng
    Huang, Qingming
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1379 - 1388
  • [4] Semantic Segmentation Facilitates Semantic Communication in Surveillance Video
    Ma, Wenbo
    Xie, Yu
    Wang, Congyan
    Zheng, Kaipeng
    Chen, Mingkai
    2024 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA, ICCC, 2024,
  • [5] Clockwork Convnets for Video Semantic Segmentation
    Shelhamer, Evan
    Rakelly, Kate
    Hoffman, Judy
    Darrell, Trevor
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 852 - 868
  • [6] Deep Video Dehazing With Semantic Segmentation
    Ren, Wenqi
    Zhang, Jingang
    Xu, Xiangyu
    Ma, Lin
    Cao, Xiaochun
    Meng, Gaofeng
    Liu, Wei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1895 - 1908
  • [7] Semantic video scene segmentation and transfer
    Gritti, Tommaso
    Damkat, Chris
    Monaci, Gianluca
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 122 : 172 - 181
  • [8] A pothole video dataset for semantic segmentation
    Ihsan, Muhammad
    Amrizal, Muhammad Alfian
    Harjoko, Agus
    DATA IN BRIEF, 2024, 53
  • [9] Semantic segmentation and description for video transcoding
    Cavallaro, A
    Steiger, O
    Ebrahimi, T
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 597 - 600
  • [10] Learning Spatiotemporal Features for Video Semantic Segmentation Using 3D Convolutional Neural Networks
    Chen, Jiamin
    Wang, Mingchen
    Jiang, Shang
    Huang, Bin
    Sun, Hongbo
    2022 6TH INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, ISCSIC, 2022, : 55 - 62