Semantic Parsing of Colonoscopy Videos with Multi-Label Temporal Networks

被引:0
|
作者
Kelner, Ori [1 ]
Weinstein, Or [1 ]
Rivlin, Ehud [1 ]
Goldenberg, Roman [1 ]
机构
[1] Verily Life Sci, San Francisco, CA 94080 USA
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW | 2023年
关键词
D O I
10.1109/ICCVW60793.2023.00274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Following the successful debut of polyp detection and characterization, more advanced automation tools are being developed for colonoscopy. The new automation tasks, such as quality metrics or report generation, require understanding of the procedure flow that includes activities, events, anatomical landmarks, etc. In this work we present a method for automatic semantic parsing of colonoscopy videos. The method uses a novel DL multi-label temporal segmentation model trained in supervised and unsupervised regimes. We evaluate the accuracy of the method on a test set of over 300 annotated colonoscopy videos, and use ablation to explore the relative importance of various method's components.
引用
收藏
页码:2591 / 2598
页数:8
相关论文
共 50 条
  • [1] A framework for parsing colonoscopy videos for semantic units
    Cao, Y
    Tavanapong, W
    Kim, K
    Wong, J
    Oh, J
    de Groen, PC
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1879 - 1882
  • [2] Multiple Semantic Embedding with Graph Convolutional Networks for Multi-Label Image Classification
    Zhou, Tong
    Feng, Songhe
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 449 - 461
  • [3] Semantic Embedding Graph Convolutional Networks for Multi-label Video Segment Classification
    Li, Zhitao
    Wang, Jianzong
    Cheng, Ning
    Xiao, Jing
    PAAP 2021: 2021 12TH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING, 2021, : 146 - 151
  • [4] Crowdsourced Semantic Matching of Multi-Label Annotations
    Duan, Lei
    Oyama, Satoshi
    Kurihara, Masahito
    Sato, Haruhiko
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3483 - 3489
  • [5] A multi-scale semantic attention representation for multi-label image recognition with graph networks
    Liang, Jun
    Xu, Feiteng
    Yu, Songsen
    Neurocomputing, 2022, 491 : 14 - 23
  • [6] Learning Multi-level Region Consistency with Dense Multi-label Networks for Semantic Segmentation
    Shen, Tong
    Lin, Guosheng
    Shen, Chunhua
    Reid, Ian
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2708 - 2714
  • [7] A multi-scale semantic attention representation for multi-label image recognition with graph networks
    Liang, Jun
    Xu, Feiteng
    Yu, Songsen
    NEUROCOMPUTING, 2022, 491 : 14 - 23
  • [8] Hierarchical Multi-Label Classification Networks
    Wehrmann, Jonatas
    Cerri, Ricardo
    Barros, Rodrigo C.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [9] Integrating Label Semantic Similarity Scores into Multi-label Text Classification
    Chen, Zihao
    Liu, Yang
    Cheng, Baitai
    Peng, Jing
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 234 - 245
  • [10] Multi-label Text Classification Method Based on Label Semantic Information
    Xiao L.
    Chen B.-L.
    Huang X.
    Liu H.-F.
    Jing L.-P.
    Yu J.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04): : 1079 - 1089