Video Polyp Segmentation: A Deep Learning Perspective

被引:0
|
作者
Ge-Peng Ji
Guobao Xiao
Yu-Cheng Chou
Deng-Ping Fan
Kai Zhao
Geng Chen
Luc Van Gool
机构
[1] Australian National University,Research School of Engineering
[2] Minjiang University,College of Computer and Control Engineering
[3] Johns Hopkins University,Department of Computer Science
[4] ETH Zürich,Computer Vision Laboratory
[5] University of California,Department of Radiological Sciences
[6] Northwestern Polytechnical University,School of Computer Science and Engineering
来源
关键词
Video polyp segmentation (VPS); dataset; self-attention; colonoscopy; abdomen;
D O I
暂无
中图分类号
学科分类号
摘要
We present the first comprehensive video polyp segmentation (VPS) study in the deep learning era. Over the years, developments in VPS are not moving forward with ease due to the lack of a large-scale dataset with fine-grained segmentation annotations. To address this issue, we first introduce a high-quality frame-by-frame annotated VPS dataset, named SUN-SEG, which contains 158 690 colonoscopy video frames from the well-known SUN-database. We provide additional annotation covering diverse types, i.e., attribute, object mask, boundary, scribble, and polygon. Second, we design a simple but efficient baseline, named PNS+, which consists of a global encoder, a local encoder, and normalized self-attention (NS) blocks. The global and local encoders receive an anchor frame and multiple successive frames to extract long-term and short-term spatial-temporal representations, which are then progressively refined by two NS blocks. Extensive experiments show that PNS+ achieves the best performance and real-time inference speed (170 fps), making it a promising solution for the VPS task. Third, we extensively evaluate 13 representative polyp/object segmentation models on our SUN-SEG dataset and provide attribute-based comparisons. Finally, we discuss several open issues and suggest possible research directions for the VPS community. Our project and dataset are publicly available at https://github.com/GewelsJI/VPS.
引用
收藏
页码:531 / 549
页数:18
相关论文
共 50 条
  • [1] Video Polyp Segmentation:A Deep Learning Perspective
    Ge-Peng Ji
    Guobao Xiao
    Yu-Cheng Chou
    Deng-Ping Fan
    Kai Zhao
    Geng Chen
    Luc Van Gool
    Machine Intelligence Research, 2022, 19 (06) : 531 - 549
  • [2] Video Polyp Segmentation: A Deep Learning Perspective
    Ji, Ge-Peng
    Xiao, Guobao
    Chou, Yu-Cheng
    Fan, Deng-Ping
    Zhao, Kai
    Chen, Geng
    Van Gool, Luc
    MACHINE INTELLIGENCE RESEARCH, 2022, 19 (06) : 531 - 549
  • [3] DEEP LEARNING FOR POLYP SEGMENTATION
    Wang, Liansheng
    Xie, Cong
    Hu, Yanxing
    GUT, 2018, 67 : A84 - A85
  • [4] Exploiting Deep Learning Techniques for Colon Polyp Segmentation
    Sierra-Sosa, Daniel
    Patino-Barrientos, Sebastian
    Garcia-Zapirain, Begonya
    Castillo-Olea, Cristian
    Elmaghraby, Adel
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (02): : 1629 - 1644
  • [5] A survey of deep learning algorithms for colorectal polyp segmentation
    Li, Sheng
    Ren, Yipei
    Yu, Yulin
    Jiang, Qianru
    He, Xiongxiong
    Li, Hongzhang
    NEUROCOMPUTING, 2025, 614
  • [6] Review of Application of Deep Learning in Colon Polyp Segmentation
    Sun, Fuyan
    Wang, Qiong
    Lyu, Zongwang
    Gong, Chunyan
    Computer Engineering and Applications, 2023, 59 (23) : 15 - 27
  • [7] Polyp detection in video colonoscopy using deep learning
    Luca, Mihaela
    Ciobanu, Adrian
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (02) : 1751 - 1759
  • [8] Polyp characterization using deep learning and a publicly accessible polyp video database
    Kader, Rawen
    Cid-Mejias, Anton
    Brandao, Patrick
    Islam, Shahraz
    Hebbar, Sanjith
    Puyal, Juana Gonzalez-Bueno
    Ahmad, Omer F.
    Hussein, Mohamed
    Toth, Daniel
    Mountney, Peter
    Seward, Ed
    Vega, Roser
    Stoyanov, Danail
    Lovat, Laurence B.
    DIGESTIVE ENDOSCOPY, 2023, 35 (05) : 645 - 655
  • [9] Deep learning for video object segmentation: a review
    Mingqi Gao
    Feng Zheng
    James J. Q. Yu
    Caifeng Shan
    Guiguang Ding
    Jungong Han
    Artificial Intelligence Review, 2023, 56 : 457 - 531
  • [10] Human segmentation in surveillance video with deep learning
    Monica Gruosso
    Nicola Capece
    Ugo Erra
    Multimedia Tools and Applications, 2021, 80 : 1175 - 1199