Instance segmentation for whole slide imaging: end-to-end or detect-then-segment

被引:10
|
作者
Jha, Aadarsh [1 ]
Yang, Haichun [2 ]
Deng, Ruining [1 ]
Kapp, Meghan E. [2 ]
Fogo, Agnes B. [2 ]
Huo, Yuankai [1 ]
机构
[1] Vanderbilt Univ, Dept Elect Engn & Comp Sci, Nashville, TN 37235 USA
[2] Vanderbilt Univ, Med Ctr, Dept Pathol Microbiol & Immunol, Nashville, TN USA
基金
美国国家卫生研究院;
关键词
segmentation; deep learning; U-Net; Mask-RCNN; Glomeruli; whole slide imaging; DISEASE;
D O I
10.1117/1.JMI.8.1.014001
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: Automatic instance segmentation of glomeruli within kidney whole slide imaging (WSI) is essential for clinical research in renal pathology. In computer vision, the end-to-end instance segmentation methods (e.g., Mask-RCNN) have shown their advantages relative to detect-then-segment approaches by performing complementary detection and segmentation tasks simultaneously. As a result, the end-to-end Mask-RCNN approach has been the de facto standard method in recent glomerular segmentation studies, where downsampling and patch-based techniques are used to properly evaluate the high-resolution images from WSI (e.g., >10; 000 x 10; 000 pixels on 40x). However, in high-resolution WSI, a single glomerulus itself can be more than 1000 x 1000 pixels in original resolution which yields significant information loss when the corresponding features maps are downsampled to the 28 x 28 resolution via the end-to-end Mask-RCNN pipeline. Approach: We assess if the end-to-end instance segmentation framework is optimal for high-resolution WSI objects by comparing Mask-RCNN with our proposed detect-then-segment framework. Beyond such a comparison, we also comprehensively evaluate the performance of our detect-then-segment pipeline through: (1) two of the most prevalent segmentation backbones (U-Net and DeepLab_v3); (2) six different image resolutions (512 x 512, 256 x 256, 128 x 128, 64 x 64, 32 x 32, and 28 x 28); and (3) two different color spaces (RGB and LAB). Results: Our detect-then-segment pipeline, with the DeepLab_v3 segmentation framework operating on previously detected glomeruli of 512 x 512 resolution, achieved a 0.953 Dice similarity coefficient (DSC), compared with a 0.902 DSC from the end-to-end Mask-RCNN pipeline. Further, we found that neither RGB nor LAB color spaces yield better performance when compared against each other in the context of a detect-then-segment framework. Conclusions: The detect-then-segment pipeline achieved better segmentation performance compared with the end-to-end method. Our study provides an extensive quantitative reference for other researchers to select the optimized and most accurate segmentation approach for glomeruli, or other biological objects of similar character, on high-resolution WSI. (C) 2021 Society of Photo-Optical Instrumentation Engineers (SPIE)
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Multiresolution Imaging: An End-to-End Assessment
    Rachel Alter-Gartenberg
    Journal of Mathematical Imaging and Vision, 1998, 8 : 59 - 77
  • [32] Multiresolution imaging: An end-to-end assessment
    Alter-Gartenberg, R
    JOURNAL OF MATHEMATICAL IMAGING AND VISION, 1998, 8 (01) : 59 - 77
  • [33] An end-to-end recurrent compressed sensing method to denoise, detect and demix calcium imaging data
    Zhang, Kangning
    Tang, Sean
    Zhu, Vivian
    Barchini, Majd
    Yang, Weijian
    NATURE MACHINE INTELLIGENCE, 2024, 6 (09) : 1106 - 1118
  • [34] End-to-End Instance-Level Human Parsing by Segmenting Persons
    Li, Zhuang
    Cao, Leilei
    Wang, Hongbin
    Xu, Lihong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 41 - 50
  • [35] Improved Instance Discrimination and Feature Compactness for End-to-End Person Search
    Hou, Shaowei
    Zhao, Cairong
    Chen, Zhicheng
    Wu, Jun
    Wei, Zhihua
    Miao, Duoqian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2079 - 2090
  • [36] Multi-Instance Aware Localization for End-to-End Imitation Learning
    Venkatesh, Sagar Gubbi
    Upadrashta, Raviteja
    Kolathaya, Shishir
    Amrutur, Bharadwaj
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5225 - 5230
  • [37] Evaluating Subtitle Segmentation for End-to-end Generation Systems
    Karakanta, Alina
    Buet, Franc
    Cettolo, Mauro
    Yvon, Francois
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3069 - 3078
  • [38] An end-to-end generative framework for video segmentation and recognition
    Kuehne, Hilde
    Gall, Juergen
    Serre, Thomas
    2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016), 2016,
  • [39] End-to-End Segmentation-based News Summarization
    Liu, Yang
    Zhu, Chenguang
    Zeng, Michael
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 544 - 554
  • [40] Segmentation mask guided end-to-end person search
    Zheng, Dingyuan
    Xiao, Jimin
    Huang, Kaizhu
    Zhao, Yao
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 86