Transformer-based multiple instance learning network with 2D positional encoding for histopathology image classification

被引:0
|
作者
Bin Yang [1 ]
Lei Ding [2 ]
Jianqiang Li [2 ]
Yong Li [2 ]
Guangzhi Qu [2 ]
Jingyi Wang [3 ]
Qiang Wang [2 ]
Bo Liu [2 ]
机构
[1] Academy of Military Science,Center for Strategic Assessment and Consulting
[2] Beijing University of Technology,Faculty of Information Technology
[3] Oakland University,Computer Science and Engineering Department
[4] Massey University,School of Mathematical and Computational Sciences
关键词
Weakly supervised training; Image classification; Multiple instance learning;
D O I
10.1007/s40747-025-01779-y
中图分类号
学科分类号
摘要
Digital medical imaging, particularly pathology images, is essential for cancer diagnosis but faces challenges in direct model training due to its super-resolution nature. Although weakly supervised learning has reduced the need for manual annotations, many multiple instance learning (MIL) methods struggle to effectively capture crucial spatial relationships in histopathological images. Existing methods incorporating positional information often overlook nuanced spatial correlations or use positional encoding strategies that do not fully capture the unique spatial dynamics of pathology images. To address this issue, we propose a new framework named TMIL (Transformer-based Multiple Instance Learning Network with 2D positional encoding), which leverages multiple instance learning for weakly supervised classification of histopathological images. TMIL incorporates a 2D positional encoding module, based on the Transformer, to model positional information and explore correlations between instances. Furthermore, TMIL divides histopathological images into pseudo-bags and trains patch-level feature vectors with deep metric learning to enhance classification performance. Finally, the proposed approach is evaluated on a public colorectal adenoma dataset. The experimental results show that TMIL outperforms existing MIL methods, achieving an AUC of 97.28% and an ACC of 95.19%. These findings suggest that TMIL’s integration of deep metric learning and positional encoding offers a promising approach for improving the efficiency and accuracy of pathology image analysis in cancer diagnosis.
引用
收藏
相关论文
共 50 条
  • [31] Breast Ultrasound Image Classification Based on Multiple-Instance Learning
    Jianrui Ding
    H. D. Cheng
    Jianhua Huang
    Jiafeng Liu
    Yingtao Zhang
    Journal of Digital Imaging, 2012, 25 : 620 - 627
  • [32] Breast Ultrasound Image Classification Based on Multiple-Instance Learning
    Ding, Jianrui
    Cheng, H. D.
    Huang, Jianhua
    Liu, Jiafeng
    Zhang, Yingtao
    JOURNAL OF DIGITAL IMAGING, 2012, 25 (05) : 620 - 627
  • [33] Image classification and indexing by EM based multiple-instance learning
    Pao, H. T.
    Xu, Y. Y.
    Chuang, S. C.
    Fu, H. C.
    ADVANCES IN VISUAL INFORMATION SYSTEMS, 2007, 4781 : 146 - +
  • [34] MULTIPLE-INSTANCE LEARNING WITH EFFICIENT TRANSFORMER FOR BREAST TUMOR IMAGE CLASSIFICATION IN BRIGHT CHALLENGE
    Feng Wentai
    Kuang Jinbo
    Ji Zheng
    Xu Shuoyu
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING CHALLENGES (IEEE ISBI 2022), 2022,
  • [35] A Transformer-Based Network for Anisotropic 3D Medical Image Segmentation
    Guo, Danfeng
    Terzopoulos, Demetri
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8857 - 8861
  • [36] TRANSFORMER-BASED FEDERATED LEARNING FOR MULTI-LABEL REMOTE SENSING IMAGE CLASSIFICATION
    Bueyuektas, Boris
    Weitzel, Kenneth
    Voelkers, Sebastian
    Zailskas, Felix
    Demir, Beguem
    2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2024), 2024, : 8726 - 8730
  • [37] Multiple instance learning-based two-stage metric learning network for whole slide image classification
    Li, Xiaoyu
    Yang, Bei
    Chen, Tiandong
    Gao, Zheng
    Li, Huijie
    VISUAL COMPUTER, 2024, 40 (08): : 5717 - 5732
  • [38] Hybrid multiple instance learning network for weakly supervised medical image classification and localization
    Lai, Qi
    Vong, Chi-Man
    Yan, Tao
    Wong, Pak-Kin
    Liang, Xiaokun
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 260
  • [39] Su-MICL: Severity-Guided Multiple Instance Curriculum Learning for Histopathology Image Interpretable Classification
    Yang, Mei
    Xie, Zhiying
    Wang, Zhaoxia
    Yuan, Yun
    Zhang, Jue
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (12) : 3533 - 3543
  • [40] ACTIVE LEARNING ENHANCES CLASSIFICATION OF HISTOPATHOLOGY WHOLE SLIDE IMAGES WITH ATTENTION-BASED MULTIPLE INSTANCE LEARNING
    Sadafi, Ario
    Navab, Nassir
    Marr, Carsten
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,