Semantic Parsing of Colonoscopy Videos with Multi-Label Temporal Networks

被引：0

作者：

Kelner, Ori ^{[1
]}

Weinstein, Or ^{[1
]}

Rivlin, Ehud ^{[1
]}

Goldenberg, Roman ^{[1
]}

机构：

[1] Verily Life Sci, San Francisco, CA 94080 USA

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW | 2023年

关键词：

D O I：

10.1109/ICCVW60793.2023.00274

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Following the successful debut of polyp detection and characterization, more advanced automation tools are being developed for colonoscopy. The new automation tasks, such as quality metrics or report generation, require understanding of the procedure flow that includes activities, events, anatomical landmarks, etc. In this work we present a method for automatic semantic parsing of colonoscopy videos. The method uses a novel DL multi-label temporal segmentation model trained in supervised and unsupervised regimes. We evaluate the accuracy of the method on a test set of over 300 annotated colonoscopy videos, and use ablation to explore the relative importance of various method's components.

引用

页码：2591 / 2598

页数：8

共 50 条

[1] A framework for parsing colonoscopy videos for semantic units
Cao, Y
Tavanapong, W
Kim, K
Wong, J
Oh, J
de Groen, PC
2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1879 - 1882
[2] Multiple Semantic Embedding with Graph Convolutional Networks for Multi-Label Image Classification
Zhou, Tong
Feng, Songhe
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 449 - 461
[3] Semantic Embedding Graph Convolutional Networks for Multi-label Video Segment Classification
Li, Zhitao
Wang, Jianzong
Cheng, Ning
Xiao, Jing
PAAP 2021: 2021 12TH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING, 2021, : 146 - 151
[4] Crowdsourced Semantic Matching of Multi-Label Annotations
Duan, Lei
Oyama, Satoshi
Kurihara, Masahito
Sato, Haruhiko
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3483 - 3489
[5] A multi-scale semantic attention representation for multi-label image recognition with graph networks
Liang, Jun
Xu, Feiteng
Yu, Songsen
Neurocomputing, 2022, 491 : 14 - 23
[6] Learning Multi-level Region Consistency with Dense Multi-label Networks for Semantic Segmentation
Shen, Tong
Lin, Guosheng
Shen, Chunhua
Reid, Ian
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2708 - 2714
[7] A multi-scale semantic attention representation for multi-label image recognition with graph networks
Liang, Jun
Xu, Feiteng
Yu, Songsen
NEUROCOMPUTING, 2022, 491 : 14 - 23
[8] Hierarchical Multi-Label Classification Networks
Wehrmann, Jonatas
Cerri, Ricardo
Barros, Rodrigo C.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[9] Integrating Label Semantic Similarity Scores into Multi-label Text Classification
Chen, Zihao
Liu, Yang
Cheng, Baitai
Peng, Jing
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 234 - 245
[10] Multi-label Text Classification Method Based on Label Semantic Information
Xiao L.
Chen B.-L.
Huang X.
Liu H.-F.
Jing L.-P.
Yu J.
Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04): : 1079 - 1089

← 1 2 3 4 5 →