Cross-Scale Fusion Transformer for Histopathological Image Classification

被引:4
|
作者
Huang, Sheng-Kai [1 ]
Yu, Yu-Ting [2 ]
Huang, Chun-Rong [1 ,3 ,4 ]
Cheng, Hsiu-Chi [5 ,6 ]
机构
[1] Natl Chung Hsing Univ, Dept Comp Sci & Engn, Taichung 402, Taiwan
[2] Chung Shan Med Univ, Chung Shan Med Univ Hosp, Dept Pathol, Taichung 402, Taiwan
[3] Natl Cheng Kung Univ, Cross Coll Elite Program, Tainan 701, Taiwan
[4] Natl Cheng Kung Univ, Acad Innovat Semicond & Sustainable Mfg, Tainan 701, Taiwan
[5] Natl Cheng Kung Univ, Natl Cheng Kung Univ Hosp, Inst Clin Med & Mol Med, Dept Internal Med, Tainan 701, Taiwan
[6] Minist Hlth & Welf, Dept Internal Med, Tainan Hosp, Tainan 701, Taiwan
关键词
Deep learning; Correlation; Task analysis; histopathological image classification; transformer; REPRESENTATION; ENSEMBLE; FEATURES; MODEL;
D O I
10.1109/JBHI.2023.3322387
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Histopathological images provide the medical evidences to help the disease diagnosis. However, pathologists are not always available or are overloaded by work. Moreover, the variations of pathological images with respect to different organs, cell sizes and magnification factors lead to the difficulty of developing a general method to solve the histopathological image classification problems. To address these issues, we propose a novel cross-scale fusion (CSF) transformer which consists of the multiple field-of-view patch embedding module, the transformer encoders and the cross-fusion modules. Based on the proposed modules, the CSF transformer can effectively integrate patch embeddings of different field-of-views to learn cross-scale contextual correlations, which represent tissues and cells of different sizes and magnification factors, with less memory usage and computation compared with the state-of-the-art transformers. To verify the generalization ability of the CSF transformer, experiments are performed on four public datasets of different organs and magnification factors. The CSF transformer outperforms the state-of-the-art task specific methods, convolutional neural network-based methods and transformer-based methods.
引用
收藏
页码:297 / 308
页数:12
相关论文
共 50 条
  • [41] 3D CROSS-SCALE FEATURE TRANSFORMER NETWORK FOR BRAIN MR IMAGE SUPER-RESOLUTION
    Zhang, Wanqi
    Wang, Lulu
    Chen, Wei
    Jia, Yuanyuan
    He, Zhongshi
    Du, Jinglong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1356 - 1360
  • [42] Progressive learning in cross-modal cross-scale fusion transformer for visible-infrared video-based person reidentification
    Mukhtar, Hamza
    Mukhtar, Umar Raza
    KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [43] CS-CoLBP: Cross-Scale Co-occurrence Local Binary Pattern for Image Classification
    Xiao, Bin
    Shi, Danyu
    Bi, Xiuli
    Li, Weisheng
    Gao, Xinbo
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 2327 - 2344
  • [44] Cross on Cross Attention: Deep Fusion Transformer for Image Captioning
    Zhang, Jing
    Xie, Yingshuai
    Ding, Weichao
    Wang, Zhe
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 4257 - 4268
  • [45] CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
    Chen, Chun-Fu
    Fan, Quanfu
    Panda, Rameswar
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 347 - 356
  • [46] Cross-scale collaborative network for single image super resolution
    Zhou, Ying
    Zheng, Zhichao
    Sun, Quansen
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [47] The Cross-Scale Mission
    Baumjohann, W.
    Horbury, T.
    Schwartz, S.
    Canu, P.
    Louarn, P.
    Fujimoto, M.
    Nakamura, R.
    Owen, C.
    Roux, A.
    Vaivads, A.
    FUTURE PERSPECTIVES OF SPACE PLASMA AND PARTICLE INSTRUMENTATION AND INTERNATIONAL COLLABORATIONS, 2009, 1144 : 25 - +
  • [48] DUAL PATH CROSS-SCALE ATTENTION NETWORK FOR IMAGE INPAINTING
    Ni, Yuanyuan
    Cheng, Wengang
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 4223 - 4227
  • [49] Blind image deconvolution via cross-scale dictionary learning
    Peng T.-Q.
    Yu J.
    Guo L.-N.
    Xiao C.-B.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2021, 29 (02): : 338 - 348
  • [50] Multiscale Fusion Transformer Network for Hyperspectral Image Classification
    Yuquan Gan
    Hao Zhang
    Chen Yi
    Journal of Beijing Institute of Technology, 2024, (03) : 255 - 270