A Survey on Deep Learning Technique for Video Segmentation

被引:86
|
作者
Zhou, Tianfei [1 ]
Porikli, Fatih [2 ]
Crandall, David J. [3 ]
Van Gool, Luc [1 ]
Wang, Wenguan [4 ]
机构
[1] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[2] Australian Natl Univ, Sch Comp Sci, Canberra, ACT 2601, Australia
[3] Indiana Univ, Luddy Sch Informat Comp & Engn, Bloomington, IN 47405 USA
[4] Univ Technol Sydney, Australian Artificial Intelligence Inst, ReLER Lab, Ultimo, NSW 2007, Australia
基金
澳大利亚研究理事会;
关键词
Object segmentation; Automobiles; Semantic segmentation; Task analysis; Motion segmentation; Deep learning; Roads; Video segmentation; video object segmentation; video semantic segmentation; deep learning; OBJECT SEGMENTATION; TRACKING; IMAGE; AGGREGATION; NETWORKS;
D O I
10.1109/TPAMI.2022.3225573
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video segmentation-partitioning video frames into multiple segments or objects-plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing. Recently, with the renaissance of connectionism in computer vision, there has been an influx of deep learning based approaches for video segmentation that have delivered compelling performance. In this survey, we comprehensively review two basic lines of research - generic object segmentation (of unknown categories) in videos, and video semantic segmentation - by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. We also offer a detailed overview of representative literature on both methods and datasets. We further benchmark the reviewed methods on several well-known datasets. Finally, we point out open issues in this field, and suggest opportunities for further research. We also provide a public website to continuously track developments in this fast advancing field: https://github.com/tfzhou/VS-Survey.
引用
收藏
页码:7099 / 7122
页数:24
相关论文
共 50 条
  • [31] A SURVEY ON VIDEO FACE RECOGNITION USING DEEP LEARNING
    Mustapha, Muhammad Firdaus
    Mohamad, Nur Maisarah
    Hamid, Siti Haslini A. B.
    Malik, Mohd Azry Abdul
    Noor, Mohd Rahimie M. D.
    JOURNAL OF QUALITY MEASUREMENT AND ANALYSIS, 2022, 18 (01): : 49 - 62
  • [32] A Survey of Deep Learning Video Super-Resolution
    Baniya, Arbind Agrahari
    Lee, Tsz-Kwan
    Eklund, Peter W.
    Aryal, Sunil
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2655 - 2676
  • [33] Video restoration based on deep learning: a comprehensive survey
    Claudio Rota
    Marco Buzzelli
    Simone Bianco
    Raimondo Schettini
    Artificial Intelligence Review, 2023, 56 : 5317 - 5364
  • [34] Video restoration based on deep learning: a comprehensive survey
    Rota, Claudio
    Buzzelli, Marco
    Bianco, Simone
    Schettini, Raimondo
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (06) : 5317 - 5364
  • [35] Video description: A comprehensive survey of deep learning approaches
    Ghazala Rafiq
    Muhammad Rafiq
    Gyu Sang Choi
    Artificial Intelligence Review, 2023, 56 : 13293 - 13372
  • [36] A Survey on Image Semantic Segmentation Using Deep Learning Techniques
    Cheng, Jieren
    Li, Hua
    Li, Dengbo
    Hua, Shuai
    Sheng, Victor S.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1941 - 1957
  • [37] A Survey of Research Progresses on Instance Segmentation Based on Deep Learning
    Fu, Cebin
    Tang, Xiangyan
    Yang, Yue
    Ruan, Chengchun
    Li, Binbin
    BIG DATA AND SECURITY, ICBDS 2023, PT I, 2024, 2099 : 138 - 151
  • [38] Heavy and Lightweight Deep Learning Models for Semantic Segmentation: A Survey
    Carunta, Cristina
    Carunta, Alina
    Popa, Calin-Adrian
    IEEE ACCESS, 2025, 13 : 17745 - 17765
  • [39] A comparative survey on SAR image segmentation using deep learning
    Jane, Ohtae
    Jo, Sangho
    Kim, Sungho
    2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 1292 - 1296
  • [40] A Survey of Efficient Deep Learning Models for Moving Object Segmentation
    Hou, Bingxin
    Liu, Ying
    Ling, Nam
    Ren, Yongxiong
    Liu, Lingzhi
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (01)