A Survey on Deep Learning Technique for Video Segmentation

被引：86

作者：

Zhou, Tianfei ^{[1
]}

Porikli, Fatih ^{[2
]}

Crandall, David J. ^{[3
]}

Van Gool, Luc ^{[1
]}

Wang, Wenguan ^{[4
]}

机构：

[1] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland

[2] Australian Natl Univ, Sch Comp Sci, Canberra, ACT 2601, Australia

[3] Indiana Univ, Luddy Sch Informat Comp & Engn, Bloomington, IN 47405 USA

[4] Univ Technol Sydney, Australian Artificial Intelligence Inst, ReLER Lab, Ultimo, NSW 2007, Australia

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 06期

基金：

澳大利亚研究理事会;

关键词：

Object segmentation; Automobiles; Semantic segmentation; Task analysis; Motion segmentation; Deep learning; Roads; Video segmentation; video object segmentation; video semantic segmentation; deep learning; OBJECT SEGMENTATION; TRACKING; IMAGE; AGGREGATION; NETWORKS;

D O I：

10.1109/TPAMI.2022.3225573

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video segmentation-partitioning video frames into multiple segments or objects-plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing. Recently, with the renaissance of connectionism in computer vision, there has been an influx of deep learning based approaches for video segmentation that have delivered compelling performance. In this survey, we comprehensively review two basic lines of research - generic object segmentation (of unknown categories) in videos, and video semantic segmentation - by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. We also offer a detailed overview of representative literature on both methods and datasets. We further benchmark the reviewed methods on several well-known datasets. Finally, we point out open issues in this field, and suggest opportunities for further research. We also provide a public website to continuously track developments in this fast advancing field: https://github.com/tfzhou/VS-Survey.

引用

页码：7099 / 7122

页数：24

共 50 条

[31] A SURVEY ON VIDEO FACE RECOGNITION USING DEEP LEARNING
Mustapha, Muhammad Firdaus
Mohamad, Nur Maisarah
Hamid, Siti Haslini A. B.
Malik, Mohd Azry Abdul
Noor, Mohd Rahimie M. D.
JOURNAL OF QUALITY MEASUREMENT AND ANALYSIS, 2022, 18 (01): : 49 - 62
[32] A Survey of Deep Learning Video Super-Resolution
Baniya, Arbind Agrahari
Lee, Tsz-Kwan
Eklund, Peter W.
Aryal, Sunil
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2655 - 2676
[33] Video restoration based on deep learning: a comprehensive survey
Claudio Rota
Marco Buzzelli
Simone Bianco
Raimondo Schettini
Artificial Intelligence Review, 2023, 56 : 5317 - 5364
[34] Video restoration based on deep learning: a comprehensive survey
Rota, Claudio
Buzzelli, Marco
Bianco, Simone
Schettini, Raimondo
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (06) : 5317 - 5364
[35] Video description: A comprehensive survey of deep learning approaches
Ghazala Rafiq
Muhammad Rafiq
Gyu Sang Choi
Artificial Intelligence Review, 2023, 56 : 13293 - 13372
[36] A Survey on Image Semantic Segmentation Using Deep Learning Techniques
Cheng, Jieren
Li, Hua
Li, Dengbo
Hua, Shuai
Sheng, Victor S.
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1941 - 1957
[37] A Survey of Research Progresses on Instance Segmentation Based on Deep Learning
Fu, Cebin
Tang, Xiangyan
Yang, Yue
Ruan, Chengchun
Li, Binbin
BIG DATA AND SECURITY, ICBDS 2023, PT I, 2024, 2099 : 138 - 151
[38] Heavy and Lightweight Deep Learning Models for Semantic Segmentation: A Survey
Carunta, Cristina
Carunta, Alina
Popa, Calin-Adrian
IEEE ACCESS, 2025, 13 : 17745 - 17765
[39] A comparative survey on SAR image segmentation using deep learning
Jane, Ohtae
Jo, Sangho
Kim, Sungho
2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 1292 - 1296
[40] A Survey of Efficient Deep Learning Models for Moving Object Segmentation
Hou, Bingxin
Liu, Ying
Ling, Nam
Ren, Yongxiong
Liu, Lingzhi
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (01)

← 1 2 3 4 5 →