Video Polyp Segmentation: A Deep Learning Perspective

被引:0
|
作者
Ge-Peng Ji
Guobao Xiao
Yu-Cheng Chou
Deng-Ping Fan
Kai Zhao
Geng Chen
Luc Van Gool
机构
[1] Australian National University,Research School of Engineering
[2] Minjiang University,College of Computer and Control Engineering
[3] Johns Hopkins University,Department of Computer Science
[4] ETH Zürich,Computer Vision Laboratory
[5] University of California,Department of Radiological Sciences
[6] Northwestern Polytechnical University,School of Computer Science and Engineering
来源
Machine Intelligence Research | 2022年 / 19卷
关键词
Video polyp segmentation (VPS); dataset; self-attention; colonoscopy; abdomen;
D O I
暂无
中图分类号
学科分类号
摘要
We present the first comprehensive video polyp segmentation (VPS) study in the deep learning era. Over the years, developments in VPS are not moving forward with ease due to the lack of a large-scale dataset with fine-grained segmentation annotations. To address this issue, we first introduce a high-quality frame-by-frame annotated VPS dataset, named SUN-SEG, which contains 158 690 colonoscopy video frames from the well-known SUN-database. We provide additional annotation covering diverse types, i.e., attribute, object mask, boundary, scribble, and polygon. Second, we design a simple but efficient baseline, named PNS+, which consists of a global encoder, a local encoder, and normalized self-attention (NS) blocks. The global and local encoders receive an anchor frame and multiple successive frames to extract long-term and short-term spatial-temporal representations, which are then progressively refined by two NS blocks. Extensive experiments show that PNS+ achieves the best performance and real-time inference speed (170 fps), making it a promising solution for the VPS task. Third, we extensively evaluate 13 representative polyp/object segmentation models on our SUN-SEG dataset and provide attribute-based comparisons. Finally, we discuss several open issues and suggest possible research directions for the VPS community. Our project and dataset are publicly available at https://github.com/GewelsJI/VPS.
引用
收藏
页码:531 / 549
页数:18
相关论文
共 50 条
  • [21] Application and Prospect of Deep Learning in Video Object Segmentation
    Chen J.
    Chen Y.-S.
    Li W.-H.
    Tian Y.
    Liu Z.
    He Y.
    Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (03): : 609 - 631
  • [22] Deep Reinforcement Learning for Object Segmentation in Video Sequences
    Sahba, Farhang
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 857 - 860
  • [23] Unsupervised Video Object Segmentation for Deep Reinforcement Learning
    Goel, Vik
    Weng, Jameson
    Poupart, Pascal
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [24] DilatedSegNet: A Deep Dilated Segmentation Network for Polyp Segmentation
    Tomar, Nikhil Kumar
    Jha, Debesh
    Bagci, Ulas
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 334 - 344
  • [25] Lightweight Deep Learning Model for Real-Time Colorectal Polyp Segmentation
    Jeong, Seung-Min
    Lee, Seung-Gun
    Seok, Chae-Lin
    Lee, Eui-Chul
    Lee, Jun-Young
    ELECTRONICS, 2023, 12 (09)
  • [26] Deep Vision: Lane Detection in ITS: A Deep Learning Segmentation Perspective
    Santhiya, P.
    Jebadurai, Immanuel JohnRaja
    Paulraj, Getzi Jeba Leelipushpam
    Jenefa, A.
    Karan, S. Kiruba
    Naveen, Edward, V
    2024 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS, ICICI 2024, 2024, : 21 - 26
  • [27] A survey on deep learning techniques for image and video semantic segmentation
    Garcia-Garcia, Alberto
    Orts-Escolano, Sergio
    Oprea, Sergiu
    Villena-Martinez, Victor
    Martinez-Gonzalez, Pablo
    Garcia-Rodriguez, Jose
    APPLIED SOFT COMPUTING, 2018, 70 : 41 - 65
  • [28] Temporal video scene segmentation using deep-learning
    Tiago Henrique Trojahn
    Rudinei Goularte
    Multimedia Tools and Applications, 2021, 80 : 17487 - 17513
  • [29] Temporal video scene segmentation using deep-learning
    Trojahn, Tiago Henrique
    Goularte, Rudinei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (12) : 17487 - 17513
  • [30] Segmentation of Motion Objects in Video Frames using Deep Learning
    Jiang, Feng
    Liu, Jiao
    Tian, Jiya
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 11 - 20