MSHT: Multi-Stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer

被引:8
|
作者
Zhang, Tianyi [1 ]
Feng, Yunlu [2 ]
Zhao, Yu [3 ]
Fan, Guangda [1 ]
Yang, Aiming [2 ]
Lyu, Shangqing [4 ]
Zhang, Peng [1 ]
Song, Fan [1 ]
Ma, Chenbin [1 ]
Sun, Yangyang [1 ]
Feng, Youdan [1 ]
Zhang, Guanglei [1 ]
机构
[1] Beihang Univ, Beijing Adv Innovat Ctr Biomed Engn, Sch Biol Sci & Med Engn, Beijing 100191, Peoples R China
[2] Peking Union Med Coll Hosp, Dept Gastroenterol, Beijing 100006, Peoples R China
[3] Peking Union Med Coll Hosp, Dept Pathol, Beijing 100006, Peoples R China
[4] Univ Southampton, Sch Elect & Comp Sci, Southampton SO17 1BJ, Hampshire, England
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Transformers; Feature extraction; Convolutional neural networks; Pancreatic cancer; Cancer; Image analysis; Solid modeling; Cytopathology; deep learning; pancreatic cancer; rapid on-site evaluation (ROSE); Transformer; FINE-NEEDLE-ASPIRATION; EUS-FNA; DIAGNOSTIC-ACCURACY; CYTOLOGY; IMPROVE;
D O I
10.1109/JBHI.2023.3234289
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pancreatic cancer is one of the most malignant cancers with high mortality. The rapid on-site evaluation (ROSE) technique can significantly accelerate the diagnostic workflow of pancreatic cancer by immediately analyzing the fast-stained cytopathological images with on-site pathologists. However, the broader expansion of ROSE diagnosis has been hindered by the shortage of experienced pathologists. Deep learning has great potential for the automatic classification of ROSE images in diagnosis. But it is challenging to model the complicated local and global image features. The traditional convolutional neural network (CNN) structure can effectively extract spatial features, while it tends to ignore global features when the prominent local features are misleading. In contrast, the Transformer structure has excellent advantages in capturing global features and long-range relations, while it has limited ability in utilizing local features. We propose a multi-stage hybrid Transformer (MSHT) to combine the strengths of both, where a CNN backbone robustly extracts multi-stage local features at different scales as the attention guidance, and a Transformer encodes them for sophisticated global modeling. Going beyond the strength of each single method, the MSHT can simultaneously enhance the Transformer global modeling ability with the local guidance from CNN features. To evaluate the method in this unexplored field, a dataset of 4240 ROSE images is collected where MSHT achieves 95.68% in classification accuracy with more accurate attention regions. The distinctively superior results compared to the state-of-the-art models make MSHT extremely promising for cytopathological image analysis.
引用
收藏
页码:1946 / 1957
页数:12
相关论文
共 50 条
  • [21] MuTr: Multi-Stage Transformer for Hand Pose Estimation from Full-Scene Depth Image
    Kanis, Jakub
    Gruber, Ivan
    Krnoul, Zdenek
    Bohacek, Matyas
    Straka, Jakub
    Hruz, Marek
    SENSORS, 2023, 23 (12)
  • [22] SAMT-generator: A second-attention for image captioning based on multi-stage transformer network
    Yang, Xiaobao
    Yang, Yang
    Ma, Sugang
    Li, Zhijie
    Dong, Wei
    Wozniak, Marcin
    NEUROCOMPUTING, 2024, 593
  • [23] THE IMPACT OF MODERNIZATION ON THE PRODUCTIVITY OF A MULTI-STAGE PRODUCTION OF TRANSFORMER SHEETS
    Michlowicz, Edward
    Smolinska, Katarzyna
    METAL 2015: 24TH INTERNATIONAL CONFERENCE ON METALLURGY AND MATERIALS, 2015, : 1720 - 1726
  • [24] Efficient multi-stage feedback attention for diverse lesion in cancer image segmentation
    Arsa, Dewa Made Sri
    Ilyas, Talha
    Park, Seok-Hwan
    Chua, Leon
    Kim, Hyongsuk
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 116
  • [25] Multi-stage models of cancer and disease
    Webster, Anthony
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2021, 50
  • [26] DACTransNet: A Hybrid CNN-Transformer Network for Histopathological Image Classification of Pancreatic Cancer
    Kou, Yongqing
    Xia, Cong
    Jiao, Yiping
    Zhang, Daoqiang
    Ge, Rongjun
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 422 - 434
  • [27] ENERGY AND EXERGY ANALYSIS OF A MULTI-STAGE HYBRID CONCENTRATED SOLAR POWER PLANT
    Moreno-Gamboa, Faustino
    Nieto-Londono, Cesar
    Escudero-Atehortua, Ana
    Lopera, Leonardo
    PROCEEDINGS OF THE ASME 2020 POWER CONFERENCE (POWER2020), 2020,
  • [28] Technology Enablers for Big Data, Multi-Stage Analysis in Medical Image Processing
    Bao, Shunxing
    Parvarthaneni, Prasanna
    Huo, Yuankai
    Barve, Yogesh
    Plassard, Andrew J.
    Yao, Yuang
    Sun, Hongyang
    Lyu, Ilwoo
    Zald, David H.
    Landman, Bennett A.
    Gokhale, Aniruddha
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 1337 - 1346
  • [29] Multi-stage optimal component analysis
    Wu, Yiming
    Liu, Xiuwen
    Mio, Washington
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2544 - +
  • [30] SI-ViT: Shuffle instance-based Vision Transformer for pancreatic cancer ROSE image classification
    Zhang, Tianyi
    Feng, Youdan
    Zhao, Yu
    Lei, Yanli
    Ying, Nan
    Song, Fan
    He, Yufang
    Yan, Zhiling
    Feng, Yunlu
    Yang, Aiming
    Zhang, Guanglei
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 244