Cross-Scale Fusion Transformer for Histopathological Image Classification

被引:4
|
作者
Huang, Sheng-Kai [1 ]
Yu, Yu-Ting [2 ]
Huang, Chun-Rong [1 ,3 ,4 ]
Cheng, Hsiu-Chi [5 ,6 ]
机构
[1] Natl Chung Hsing Univ, Dept Comp Sci & Engn, Taichung 402, Taiwan
[2] Chung Shan Med Univ, Chung Shan Med Univ Hosp, Dept Pathol, Taichung 402, Taiwan
[3] Natl Cheng Kung Univ, Cross Coll Elite Program, Tainan 701, Taiwan
[4] Natl Cheng Kung Univ, Acad Innovat Semicond & Sustainable Mfg, Tainan 701, Taiwan
[5] Natl Cheng Kung Univ, Natl Cheng Kung Univ Hosp, Inst Clin Med & Mol Med, Dept Internal Med, Tainan 701, Taiwan
[6] Minist Hlth & Welf, Dept Internal Med, Tainan Hosp, Tainan 701, Taiwan
关键词
Deep learning; Correlation; Task analysis; histopathological image classification; transformer; REPRESENTATION; ENSEMBLE; FEATURES; MODEL;
D O I
10.1109/JBHI.2023.3322387
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Histopathological images provide the medical evidences to help the disease diagnosis. However, pathologists are not always available or are overloaded by work. Moreover, the variations of pathological images with respect to different organs, cell sizes and magnification factors lead to the difficulty of developing a general method to solve the histopathological image classification problems. To address these issues, we propose a novel cross-scale fusion (CSF) transformer which consists of the multiple field-of-view patch embedding module, the transformer encoders and the cross-fusion modules. Based on the proposed modules, the CSF transformer can effectively integrate patch embeddings of different field-of-views to learn cross-scale contextual correlations, which represent tissues and cells of different sizes and magnification factors, with less memory usage and computation compared with the state-of-the-art transformers. To verify the generalization ability of the CSF transformer, experiments are performed on four public datasets of different organs and magnification factors. The CSF transformer outperforms the state-of-the-art task specific methods, convolutional neural network-based methods and transformer-based methods.
引用
收藏
页码:297 / 308
页数:12
相关论文
共 50 条
  • [21] Spatially Separable Attention Transformer with Cross-Scale Encoding for Remote Sensing Image Road Extraction
    Tian, Qing
    Zhang, Yao
    Zhang, Zheng
    Lyu, Qixiu
    Computer Engineering and Applications, 60 (23): : 219 - 228
  • [22] CASF-Net: Cross-attention and cross-scale fusion network for medical image segmentation
    Zheng, Jianwei
    Liu, Hao
    Feng, Yuchao
    Xu, Jinshan
    Zhao, Liang
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 229
  • [23] Improving Polyp Segmentation with Boundary-Assisted Guidance and Cross-Scale Interaction Fusion Transformer Network
    Jiang, Lincen
    Hui, Yan
    Fei, Yuan
    Ji, Yimu
    Zeng, Tao
    PROCESSES, 2024, 12 (05)
  • [24] The use of KPCA over subspaces for cross-scale superpixel based hyperspectral image classification
    Yu, Haoyang
    Xu, Zhen
    Wang, Yulei
    Jiao, Tong
    Guo, Qiandong
    REMOTE SENSING LETTERS, 2021, 12 (05) : 470 - 477
  • [25] Cross-scale cascade transformer for multimodal human action recognition
    Liu, Zhen
    Cheng, Qin
    Song, Chengqun
    Cheng, Jun
    PATTERN RECOGNITION LETTERS, 2023, 168 : 17 - 23
  • [26] CROSSFORMER: A VERSATILE VISION TRANSFORMER HINGING ON CROSS-SCALE ATTENTION
    Wang, Wenxiao
    Yao, Lu
    Chen, Long
    Lin, Binbin
    Cai, Deng
    He, Xiaofei
    Liu, Wei
    ICLR 2022 - 10th International Conference on Learning Representations, 2022,
  • [27] Cross-scale feature fusion connection for a YOLO detector
    Ruan, Zhongling
    Wang, Hao
    Cao, Jianzhong
    Zhang, Hongbo
    IET COMPUTER VISION, 2022, 16 (02) : 99 - 110
  • [28] Transformer based on multi-scale local feature for colon cancer histopathological image classification
    Fu, Zhibing
    Chen, Qingkui
    Wang, Mingming
    Huang, Chen
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [29] Cross-scale feature fusion-based JND estimation for robust image in DWT domain
    Wang, Chunxing
    Li, Shang
    Liu, Yan
    Meng, Lili
    Zhang, Kai
    Wan, Wenbo
    OPTIK, 2023, 272
  • [30] CSPA-GAN: A Cross-Scale Pyramid Attention GAN for Infrared and Visible Image Fusion
    Yin, Haitao
    Xiao, Jinghu
    Chen, Hao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72