Cross-Scale Fusion Transformer for Histopathological Image Classification

被引：4

作者：

Huang, Sheng-Kai ^{[1
]}

Yu, Yu-Ting ^{[2
]}

Huang, Chun-Rong ^{[1
,3
,4
]}

Cheng, Hsiu-Chi ^{[5
,6
]}

机构：

[1] Natl Chung Hsing Univ, Dept Comp Sci & Engn, Taichung 402, Taiwan

[2] Chung Shan Med Univ, Chung Shan Med Univ Hosp, Dept Pathol, Taichung 402, Taiwan

[3] Natl Cheng Kung Univ, Cross Coll Elite Program, Tainan 701, Taiwan

[4] Natl Cheng Kung Univ, Acad Innovat Semicond & Sustainable Mfg, Tainan 701, Taiwan

[5] Natl Cheng Kung Univ, Natl Cheng Kung Univ Hosp, Inst Clin Med & Mol Med, Dept Internal Med, Tainan 701, Taiwan

[6] Minist Hlth & Welf, Dept Internal Med, Tainan Hosp, Tainan 701, Taiwan

来源：

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS | 2024年 / 28卷 / 01期

关键词：

Deep learning; Correlation; Task analysis; histopathological image classification; transformer; REPRESENTATION; ENSEMBLE; FEATURES; MODEL;

D O I：

10.1109/JBHI.2023.3322387

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Histopathological images provide the medical evidences to help the disease diagnosis. However, pathologists are not always available or are overloaded by work. Moreover, the variations of pathological images with respect to different organs, cell sizes and magnification factors lead to the difficulty of developing a general method to solve the histopathological image classification problems. To address these issues, we propose a novel cross-scale fusion (CSF) transformer which consists of the multiple field-of-view patch embedding module, the transformer encoders and the cross-fusion modules. Based on the proposed modules, the CSF transformer can effectively integrate patch embeddings of different field-of-views to learn cross-scale contextual correlations, which represent tissues and cells of different sizes and magnification factors, with less memory usage and computation compared with the state-of-the-art transformers. To verify the generalization ability of the CSF transformer, experiments are performed on four public datasets of different organs and magnification factors. The CSF transformer outperforms the state-of-the-art task specific methods, convolutional neural network-based methods and transformer-based methods.

引用

页码：297 / 308

页数：12

共 50 条

[21] Spatially Separable Attention Transformer with Cross-Scale Encoding for Remote Sensing Image Road Extraction
Tian, Qing
Zhang, Yao
Zhang, Zheng
Lyu, Qixiu
Computer Engineering and Applications, 60 (23): : 219 - 228
[22] CASF-Net: Cross-attention and cross-scale fusion network for medical image segmentation
Zheng, Jianwei
Liu, Hao
Feng, Yuchao
Xu, Jinshan
Zhao, Liang
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 229
[23] Improving Polyp Segmentation with Boundary-Assisted Guidance and Cross-Scale Interaction Fusion Transformer Network
Jiang, Lincen
Hui, Yan
Fei, Yuan
Ji, Yimu
Zeng, Tao
PROCESSES, 2024, 12 (05)
[24] The use of KPCA over subspaces for cross-scale superpixel based hyperspectral image classification
Yu, Haoyang
Xu, Zhen
Wang, Yulei
Jiao, Tong
Guo, Qiandong
REMOTE SENSING LETTERS, 2021, 12 (05) : 470 - 477
[25] Cross-scale cascade transformer for multimodal human action recognition
Liu, Zhen
Cheng, Qin
Song, Chengqun
Cheng, Jun
PATTERN RECOGNITION LETTERS, 2023, 168 : 17 - 23
[26] CROSSFORMER: A VERSATILE VISION TRANSFORMER HINGING ON CROSS-SCALE ATTENTION
Wang, Wenxiao
Yao, Lu
Chen, Long
Lin, Binbin
Cai, Deng
He, Xiaofei
Liu, Wei
ICLR 2022 - 10th International Conference on Learning Representations, 2022,
[27] Cross-scale feature fusion connection for a YOLO detector
Ruan, Zhongling
Wang, Hao
Cao, Jianzhong
Zhang, Hongbo
IET COMPUTER VISION, 2022, 16 (02) : 99 - 110
[28] Transformer based on multi-scale local feature for colon cancer histopathological image classification
Fu, Zhibing
Chen, Qingkui
Wang, Mingming
Huang, Chen
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
[29] Cross-scale feature fusion-based JND estimation for robust image in DWT domain
Wang, Chunxing
Li, Shang
Liu, Yan
Meng, Lili
Zhang, Kai
Wan, Wenbo
OPTIK, 2023, 272
[30] CSPA-GAN: A Cross-Scale Pyramid Attention GAN for Infrared and Visible Image Fusion
Yin, Haitao
Xiao, Jinghu
Chen, Hao
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72

← 1 2 3 4 5 →