An automated detection system for colonoscopy images using a dual encoder-decoder model

被引:7
|
作者
Hwang, Maxwell [1 ,2 ,3 ]
Wang, Da [1 ,2 ,3 ]
Kong, Xiang-Xing [1 ,2 ,3 ]
Wang, Zhanhuai [1 ,2 ,3 ]
Li, Jun [1 ,2 ,3 ]
Jiang, Wei-Cheng [4 ]
Hwang, Kao-Shing [5 ]
Ding, Kefeng [1 ,2 ,3 ]
机构
[1] Zhejiang Univ, Dept Colorectal Surg, Affiliated Hosp 2, Sch Med, Hangzhou, Peoples R China
[2] China Natl Minist Educ, Key Lab Mol Biol Med Sci, Key Lab Canc Prevent & Intervent, Canc Inst, Hangzhou, Zhejiang, Peoples R China
[3] Zhejiang Univ, Sch Med, Affiliated Hosp 2, Hangzhou, Peoples R China
[4] Tunghai Univ, Dept Elect Engn, Taichung, Taiwan
[5] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung, Taiwan
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Colorectal cancer; Computer-aided detection; Deep learning; Polyp detection; Convolutional neural network; POLYPS;
D O I
10.1016/j.compmedimag.2020.101763
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Conventional computer-aided detection systems (CADs) for colonoscopic images utilize shape, texture, or temporal information to detect polyps, so they have limited sensitivity and specificity. This study proposes a method to extract possible polyp features automatically using convolutional neural networks (CNNs). The objective of this work aims at building up a light-weight dual encoder-decoder model structure for polyp detection in colonoscopy Images. This proposed model, though with a relatively shallow structure, is expected to have the capability of a similar performance to the methods with much deeper structures. The proposed CAD model consists of two sequential encoder-decoder networks that consist of several CNN layers and full connection layers. The front end of the model is a hetero-associator (also known as hetero-encoder) that uses backpropagation learning to generate a set of reliably corrupted labeled images with a certain degree of similarity to a ground truth image, which eliminates the need for a large amount of training data that is usually required for medical images tasks. This dual CNN architecture generates a set of noisy images that are similar to the labeled data to train its counterpart, the auto-associator (also known as auto-encoder), in order to increase the successor's discriminative power in classification. The auto-encoder is also equipped with CNNs to simultaneously capture the features of the labeled images that contain noise. The proposed method uses features that are learned from open medical datasets and the dataset of Zhejiang University (ZJU), which contains around one thousand images. The performance of the proposed architecture is compared with a state-of-the-art detection model in terms of the metrics of the Jaccard index, the DICE similarity score, and two other geometric measures. The improvements in the performance of the proposed model are attributed to the effective reduction in false positives in the auto-encoder and the generation of noisy candidate images by the hetero-encoder. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Dual Encoder-Decoder U-Net Architecture for Polyp Segmentation in Colonoscopy Images with Shuffle Attention and Conditional Random Fields
    Lijin, P.
    Santhosh, Kumar G.
    Nair, Madhu S.
    10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
  • [22] VisCode: Embedding Information in Visualization Images using Encoder-Decoder Network
    Zhang, Peiying
    Li, Chenhui
    Wang, Changbo
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) : 326 - 336
  • [23] Semantic pixel labelling in remote sensing images using a deep convolutional encoder-decoder model
    Wei, Xin
    Fu, Kun
    Gao, Xin
    Yan, Menglong
    Sun, Xian
    Chen, Kaiqiang
    Sun, Hao
    REMOTE SENSING LETTERS, 2018, 9 (03) : 199 - 208
  • [24] Deep Architecture Based Spalling Severity Detection System Using Encoder-Decoder Networks
    Yasmin, Tamanna
    Le, Chuong
    Hung Manh La
    ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT II, 2022, 13599 : 332 - 343
  • [25] A Similarity Searching System for Biological Phenotype images Using Deep Convolutional Encoder-decoder Architecture
    Wu, Bizhi
    Zhang, Hanxiao
    Lin, Limei
    Wang, Huiyuan
    Gao, Yubang
    Zhao, Liangzhen
    Chen, Yi-Ping Phoebe
    Chen, Riqing
    Gu, Lianfeng
    CURRENT BIOINFORMATICS, 2019, 14 (07) : 628 - 639
  • [26] Encoder-decoder multimodal speaker change detection
    Jung, Jee-weon
    Seo, Soonshin
    Heo, Hee-Soo
    Kim, Geonmin
    Kim, You Jin
    Kwon, Young-ki
    Lee, Minjae
    Lee, Bong-Jin
    INTERSPEECH 2023, 2023, : 5311 - 5315
  • [27] Filling gaps of cartographic polylines by using an encoder-decoder model
    Yu, Wenhao
    Chen, Yujie
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2022, 36 (11) : 2296 - 2321
  • [28] Learning Depth for Scene Reconstruction Using an Encoder-Decoder Model
    Tu, Xiaohan
    Xu, Cheng
    Liu, Siping
    Xie, Guoqi
    Huang, Jing
    Li, Renfa
    Yuan, Junsong
    IEEE ACCESS, 2020, 8 : 89300 - 89317
  • [29] Automated Maxillofacial Segmentation in Panoramic Dental X-Ray Images Using an Efficient Encoder-Decoder Network
    Kong, Zhengmin
    Xiong, Feng
    Zhang, Chenggang
    Fu, Zhuolin
    Zhang, Maoqi
    Weng, Jingxin
    Fan, Mingzhe
    IEEE ACCESS, 2020, 8 : 207822 - 207833
  • [30] Transformer-based Encoder-Decoder Model for Surface Defect Detection
    Lu, Xiaofeng
    Fan, Wentao
    6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022, 2022, : 125 - 130