A Survey on Deep Learning Technique for Video Segmentation

被引:86
|
作者
Zhou, Tianfei [1 ]
Porikli, Fatih [2 ]
Crandall, David J. [3 ]
Van Gool, Luc [1 ]
Wang, Wenguan [4 ]
机构
[1] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[2] Australian Natl Univ, Sch Comp Sci, Canberra, ACT 2601, Australia
[3] Indiana Univ, Luddy Sch Informat Comp & Engn, Bloomington, IN 47405 USA
[4] Univ Technol Sydney, Australian Artificial Intelligence Inst, ReLER Lab, Ultimo, NSW 2007, Australia
基金
澳大利亚研究理事会;
关键词
Object segmentation; Automobiles; Semantic segmentation; Task analysis; Motion segmentation; Deep learning; Roads; Video segmentation; video object segmentation; video semantic segmentation; deep learning; OBJECT SEGMENTATION; TRACKING; IMAGE; AGGREGATION; NETWORKS;
D O I
10.1109/TPAMI.2022.3225573
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video segmentation-partitioning video frames into multiple segments or objects-plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing. Recently, with the renaissance of connectionism in computer vision, there has been an influx of deep learning based approaches for video segmentation that have delivered compelling performance. In this survey, we comprehensively review two basic lines of research - generic object segmentation (of unknown categories) in videos, and video semantic segmentation - by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. We also offer a detailed overview of representative literature on both methods and datasets. We further benchmark the reviewed methods on several well-known datasets. Finally, we point out open issues in this field, and suggest opportunities for further research. We also provide a public website to continuously track developments in this fast advancing field: https://github.com/tfzhou/VS-Survey.
引用
收藏
页码:7099 / 7122
页数:24
相关论文
共 50 条
  • [1] A survey on deep learning techniques for image and video semantic segmentation
    Garcia-Garcia, Alberto
    Orts-Escolano, Sergio
    Oprea, Sergiu
    Villena-Martinez, Victor
    Martinez-Gonzalez, Pablo
    Garcia-Rodriguez, Jose
    APPLIED SOFT COMPUTING, 2018, 70 : 41 - 65
  • [2] Deep video representation learning: a survey
    Ravanbakhsh, Elham
    Liang, Yongqing
    Ramanujam, J.
    Li, Xin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (20) : 59195 - 59225
  • [3] Video Polyp Segmentation: A Deep Learning Perspective
    Ji, Ge-Peng
    Xiao, Guobao
    Chou, Yu-Cheng
    Fan, Deng-Ping
    Zhao, Kai
    Chen, Geng
    Van Gool, Luc
    MACHINE INTELLIGENCE RESEARCH, 2022, 19 (06) : 531 - 549
  • [4] Video Polyp Segmentation: A Deep Learning Perspective
    Ge-Peng Ji
    Guobao Xiao
    Yu-Cheng Chou
    Deng-Ping Fan
    Kai Zhao
    Geng Chen
    Luc Van Gool
    Machine Intelligence Research, 2022, 19 : 531 - 549
  • [5] Video Polyp Segmentation:A Deep Learning Perspective
    Ge-Peng Ji
    Guobao Xiao
    Yu-Cheng Chou
    Deng-Ping Fan
    Kai Zhao
    Geng Chen
    Luc Van Gool
    Machine Intelligence Research, 2022, 19 (06) : 531 - 549
  • [6] Deep learning for video object segmentation: a review
    Mingqi Gao
    Feng Zheng
    James J. Q. Yu
    Caifeng Shan
    Guiguang Ding
    Jungong Han
    Artificial Intelligence Review, 2023, 56 : 457 - 531
  • [7] Human segmentation in surveillance video with deep learning
    Monica Gruosso
    Nicola Capece
    Ugo Erra
    Multimedia Tools and Applications, 2021, 80 : 1175 - 1199
  • [8] Deep learning for video object segmentation: a review
    Gao, Mingqi
    Zheng, Feng
    Yu, James J. Q.
    Shan, Caifeng
    Ding, Guiguang
    Han, Jungong
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (01) : 457 - 531
  • [9] Human segmentation in surveillance video with deep learning
    Gruosso, Monica
    Capece, Nicola
    Erra, Ugo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (01) : 1175 - 1199
  • [10] A frequency-driven deep learning technique for bird segmentation and detection from RGB video
    Suthaharan, Shan
    APPLICATIONS OF MACHINE LEARNING 2023, 2023, 12675