MSHT: Multi-Stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer

被引:8
|
作者
Zhang, Tianyi [1 ]
Feng, Yunlu [2 ]
Zhao, Yu [3 ]
Fan, Guangda [1 ]
Yang, Aiming [2 ]
Lyu, Shangqing [4 ]
Zhang, Peng [1 ]
Song, Fan [1 ]
Ma, Chenbin [1 ]
Sun, Yangyang [1 ]
Feng, Youdan [1 ]
Zhang, Guanglei [1 ]
机构
[1] Beihang Univ, Beijing Adv Innovat Ctr Biomed Engn, Sch Biol Sci & Med Engn, Beijing 100191, Peoples R China
[2] Peking Union Med Coll Hosp, Dept Gastroenterol, Beijing 100006, Peoples R China
[3] Peking Union Med Coll Hosp, Dept Pathol, Beijing 100006, Peoples R China
[4] Univ Southampton, Sch Elect & Comp Sci, Southampton SO17 1BJ, Hampshire, England
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Transformers; Feature extraction; Convolutional neural networks; Pancreatic cancer; Cancer; Image analysis; Solid modeling; Cytopathology; deep learning; pancreatic cancer; rapid on-site evaluation (ROSE); Transformer; FINE-NEEDLE-ASPIRATION; EUS-FNA; DIAGNOSTIC-ACCURACY; CYTOLOGY; IMPROVE;
D O I
10.1109/JBHI.2023.3234289
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pancreatic cancer is one of the most malignant cancers with high mortality. The rapid on-site evaluation (ROSE) technique can significantly accelerate the diagnostic workflow of pancreatic cancer by immediately analyzing the fast-stained cytopathological images with on-site pathologists. However, the broader expansion of ROSE diagnosis has been hindered by the shortage of experienced pathologists. Deep learning has great potential for the automatic classification of ROSE images in diagnosis. But it is challenging to model the complicated local and global image features. The traditional convolutional neural network (CNN) structure can effectively extract spatial features, while it tends to ignore global features when the prominent local features are misleading. In contrast, the Transformer structure has excellent advantages in capturing global features and long-range relations, while it has limited ability in utilizing local features. We propose a multi-stage hybrid Transformer (MSHT) to combine the strengths of both, where a CNN backbone robustly extracts multi-stage local features at different scales as the attention guidance, and a Transformer encodes them for sophisticated global modeling. Going beyond the strength of each single method, the MSHT can simultaneously enhance the Transformer global modeling ability with the local guidance from CNN features. To evaluate the method in this unexplored field, a dataset of 4240 ROSE images is collected where MSHT achieves 95.68% in classification accuracy with more accurate attention regions. The distinctively superior results compared to the state-of-the-art models make MSHT extremely promising for cytopathological image analysis.
引用
收藏
页码:1946 / 1957
页数:12
相关论文
共 50 条
  • [41] An iterative multi-stage MR image correction method
    Jia, Luzhi
    Li, Zhaohui
    Ling, Qiang
    Li, Feng
    PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 3996 - 4000
  • [42] Motion Deblurring by Fusing Multi-Stage Image Features
    Zhang, Shihua
    He, Fan
    Shao, Xun
    2024 IEEE CYBER SCIENCE AND TECHNOLOGY CONGRESS, CYBERSCITECH 2024, 2024, : 168 - 173
  • [43] A Multi-Stage Encryption Technique to Enhance the Secrecy of Image
    Mondal, Arindom
    Alain, Kazi Md Rokibul
    Ali, G. G. Md Nawaz
    Chong, Peter Han Joo
    Morimoto, Yasuhiko
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (05): : 2698 - 2717
  • [44] FINGERPRINT IMAGE DEPURATION BY MULTI-STAGE COMPUTATIONAL METHOD
    Babatunde, Iwasokun Gabriel
    Charles, Akinyokun Oluwole
    Kayode, Alese Boniface
    Olatubosun, Olabode
    IAENG TRANSACTIONS ON ELECTRICAL ENGINEERING, VOL 1, 2012, : 271 - 287
  • [45] Generative Image Inpainting with Multi-Stage Decoding Network
    Liu W.-R.
    Mi Y.-C.
    Yang F.
    Zhang Y.
    Guo H.-L.
    Liu Z.-M.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2022, 50 (03): : 625 - 636
  • [46] Multi-stage Aggregated Transformer Network for Temporal Language Localization in Videos
    Zhang, Mingxing
    Yang, Yang
    Chen, Xinghan
    Ji, Yanli
    Xu, Xing
    Li, Jingjing
    Shen, Heng Tao
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12664 - 12673
  • [47] MMPDNet: Multi-Stage & Multi-Attention Progressive Image Denoising
    Xue, Jiangbo
    Liang, Jiu
    Zhang, Yu
    He, Jinhe
    Hu, Yanda
    20TH INT CONF ON UBIQUITOUS COMP AND COMMUNICAT (IUCC) / 20TH INT CONF ON COMP AND INFORMATION TECHNOLOGY (CIT) / 4TH INT CONF ON DATA SCIENCE AND COMPUTATIONAL INTELLIGENCE (DSCI) / 11TH INT CONF ON SMART COMPUTING, NETWORKING, AND SERV (SMARTCNS), 2021, : 467 - 473
  • [48] FORMS OF APPLICATION OF CANCER MULTI-STAGE THERAPY
    ARDENNE, MV
    DEUTSCHE GESUNDHEITSWESEN-ZEITSCHRIFT FUR KLINISCHE MEDIZIN, 1974, 29 (18): : 835 - 844
  • [49] Multi-stage vectors for personalized cancer therapy
    Shen, Haifa
    Liu, Xuewu
    Ferrari, Mauro
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2014, 248
  • [50] Concept and analysis of hybrid reversal multi-stage flash and membrane distillation desalination system
    Ali, Emad
    Orfi, Jamel
    Alansary, Hany
    Baakeem, Saleh
    Alsaadi, Ahmad S.
    Ghaffour, Noreddine
    ENVIRONMENTAL TECHNOLOGY, 2024, 45 (24) : 5218 - 5231