MSHT: Multi-Stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer

被引:8
|
作者
Zhang, Tianyi [1 ]
Feng, Yunlu [2 ]
Zhao, Yu [3 ]
Fan, Guangda [1 ]
Yang, Aiming [2 ]
Lyu, Shangqing [4 ]
Zhang, Peng [1 ]
Song, Fan [1 ]
Ma, Chenbin [1 ]
Sun, Yangyang [1 ]
Feng, Youdan [1 ]
Zhang, Guanglei [1 ]
机构
[1] Beihang Univ, Beijing Adv Innovat Ctr Biomed Engn, Sch Biol Sci & Med Engn, Beijing 100191, Peoples R China
[2] Peking Union Med Coll Hosp, Dept Gastroenterol, Beijing 100006, Peoples R China
[3] Peking Union Med Coll Hosp, Dept Pathol, Beijing 100006, Peoples R China
[4] Univ Southampton, Sch Elect & Comp Sci, Southampton SO17 1BJ, Hampshire, England
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Transformers; Feature extraction; Convolutional neural networks; Pancreatic cancer; Cancer; Image analysis; Solid modeling; Cytopathology; deep learning; pancreatic cancer; rapid on-site evaluation (ROSE); Transformer; FINE-NEEDLE-ASPIRATION; EUS-FNA; DIAGNOSTIC-ACCURACY; CYTOLOGY; IMPROVE;
D O I
10.1109/JBHI.2023.3234289
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pancreatic cancer is one of the most malignant cancers with high mortality. The rapid on-site evaluation (ROSE) technique can significantly accelerate the diagnostic workflow of pancreatic cancer by immediately analyzing the fast-stained cytopathological images with on-site pathologists. However, the broader expansion of ROSE diagnosis has been hindered by the shortage of experienced pathologists. Deep learning has great potential for the automatic classification of ROSE images in diagnosis. But it is challenging to model the complicated local and global image features. The traditional convolutional neural network (CNN) structure can effectively extract spatial features, while it tends to ignore global features when the prominent local features are misleading. In contrast, the Transformer structure has excellent advantages in capturing global features and long-range relations, while it has limited ability in utilizing local features. We propose a multi-stage hybrid Transformer (MSHT) to combine the strengths of both, where a CNN backbone robustly extracts multi-stage local features at different scales as the attention guidance, and a Transformer encodes them for sophisticated global modeling. Going beyond the strength of each single method, the MSHT can simultaneously enhance the Transformer global modeling ability with the local guidance from CNN features. To evaluate the method in this unexplored field, a dataset of 4240 ROSE images is collected where MSHT achieves 95.68% in classification accuracy with more accurate attention regions. The distinctively superior results compared to the state-of-the-art models make MSHT extremely promising for cytopathological image analysis.
引用
收藏
页码:1946 / 1957
页数:12
相关论文
共 50 条
  • [31] Analysis and modelling of multi-stage attacks
    Chamotra, Saurabh
    Barbhuiya, Ferdous Ahmed
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 1269 - 1276
  • [32] Remotely sensed image processing with multi-stage inferences
    Yamamoto, H
    Homma, K
    Isobe, T
    Naka, M
    Matsumura, S
    Tameishi, H
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING V, 1999, 3871 : 337 - 342
  • [33] A multi-stage chaotic encryption technique for medical image
    Tanveer, Md. Siddiqur Rahman
    Md. Rokibul Alam, Kazi
    Morimoto, Yasuhiko
    INFORMATION SECURITY JOURNAL, 2022, 31 (06): : 657 - 675
  • [34] An improved multi-stage vector quantization for image coding
    Wang, Meng
    Ma, Hui-Ping
    Zhou, Chong-Qing
    Yang, Bian
    2007 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, VOL 1, PROCEEDINGS, 2007, : 415 - +
  • [35] Multi-Stage DC Hybrid Switch with Slow Switching
    Tang, Y.
    Duarte, J. L.
    Smeets, R. P. P.
    Motoasca, T. E.
    Lomonova, E. A.
    IECON 2011: 37TH ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2011, : 1462 - 1467
  • [36] Swin-Caption: Swin Transformer-Based Image Captioning with Feature Enhancement and Multi-Stage Fusion
    Liu, Lei
    Jiao, Yidi
    Li, Xiaoran
    Li, Jing
    Wang, Haitao
    Cao, Xinyu
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2024,
  • [37] MSM: Multi-stage Multicuts for Scalable Image Clustering
    Ho, Kalun
    Chatzimichailidis, Avraam
    Keuper, Margret
    Keuper, Janis
    HIGH PERFORMANCE COMPUTING - ISC HIGH PERFORMANCE DIGITAL 2021 INTERNATIONAL WORKSHOPS, 2021, 12761 : 267 - 284
  • [38] Multi-stage filtering for single rainy image enhancement
    Shi, Zhenghao
    Li, Yaowei
    Zhao, Minghua
    Feng, Yaning
    He, Lifeng
    IET IMAGE PROCESSING, 2018, 12 (10) : 1866 - 1872
  • [39] IMAGE DENOISING USING MULTI-STAGE SPARSE REPRESENTATIONS
    Gan, Tao
    Lu, Wenmiao
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 1165 - 1168
  • [40] MSPNet: Multi-stage progressive network for image denoising
    Bai, Yu
    Liu, Meiqin
    Yao, Chao
    Lin, Chunyu
    Zhao, Yao
    NEUROCOMPUTING, 2023, 517 : 71 - 80