A Survey on Deep Learning Technique for Video Segmentation

被引:86
|
作者
Zhou, Tianfei [1 ]
Porikli, Fatih [2 ]
Crandall, David J. [3 ]
Van Gool, Luc [1 ]
Wang, Wenguan [4 ]
机构
[1] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[2] Australian Natl Univ, Sch Comp Sci, Canberra, ACT 2601, Australia
[3] Indiana Univ, Luddy Sch Informat Comp & Engn, Bloomington, IN 47405 USA
[4] Univ Technol Sydney, Australian Artificial Intelligence Inst, ReLER Lab, Ultimo, NSW 2007, Australia
基金
澳大利亚研究理事会;
关键词
Object segmentation; Automobiles; Semantic segmentation; Task analysis; Motion segmentation; Deep learning; Roads; Video segmentation; video object segmentation; video semantic segmentation; deep learning; OBJECT SEGMENTATION; TRACKING; IMAGE; AGGREGATION; NETWORKS;
D O I
10.1109/TPAMI.2022.3225573
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video segmentation-partitioning video frames into multiple segments or objects-plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing. Recently, with the renaissance of connectionism in computer vision, there has been an influx of deep learning based approaches for video segmentation that have delivered compelling performance. In this survey, we comprehensively review two basic lines of research - generic object segmentation (of unknown categories) in videos, and video semantic segmentation - by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. We also offer a detailed overview of representative literature on both methods and datasets. We further benchmark the reviewed methods on several well-known datasets. Finally, we point out open issues in this field, and suggest opportunities for further research. We also provide a public website to continuously track developments in this fast advancing field: https://github.com/tfzhou/VS-Survey.
引用
收藏
页码:7099 / 7122
页数:24
相关论文
共 50 条
  • [41] A survey on underwater coral image segmentation based on deep learning
    Li, Ming
    Zhang, Hanqi
    Gruen, Armin
    Li, Deren
    GEO-SPATIAL INFORMATION SCIENCE, 2024,
  • [42] A Survey on Medical Image Segmentation Based on Deep Learning Techniques
    Moorthy, Jayashree
    Gandhi, Usha Devi
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (04)
  • [43] Temporal video segmentation: A survey
    Koprinska, I
    Carrato, S
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2001, 16 (05) : 477 - 500
  • [44] Video Object Segmentation: A Survey
    Sasithradevi, A.
    Roomi, S. Mohamed Mansoor
    Mareeswari, M.
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES), 2016, : 656 - 660
  • [45] Assessment of deep learning technique for fully automated mandibular segmentation
    Yurdakurban, Ebru
    Sukut, Yagizalp
    Duran, Gokhan Serhat
    AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2025, 167 (02) : 242 - 249
  • [46] Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation
    Yin, Yingjie
    Xu, De
    Wang, Xingang
    Zhang, Lei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3884 - 3894
  • [47] Optimizing Deep Learning Based Semantic Video Segmentation on Embedded GPUs
    Baba, Filip
    Kenjic, Dusan
    Bjelica, Milan
    Kastelan, Ivan
    2019 IEEE 9TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE-BERLIN), 2019, : 6 - 8
  • [48] A new multimodal deep-learning model to video scene segmentation
    Trojahn, Tiago H.
    Kishi, Rodrigo M.
    Goularte, Rudinei
    WEBMEDIA'18: PROCEEDINGS OF THE 24TH BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB, 2018, : 205 - 212
  • [49] Learning Deep Spatio-Temporal Dependence for Semantic Video Segmentation
    Qiu, Zhaofan
    Yao, Ting
    Mei, Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (04) : 939 - 949
  • [50] Deep Learning Technique for Human Parsing: A Survey and Outlook
    Yang, Lu
    Jia, Wenhe
    Li, Shan
    Song, Qing
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (08) : 3270 - 3301