Adaptive aggregation with self-attention network for gastrointestinal image classification

被引:13
|
作者
Li, Sheng [1 ]
Cao, Jing [1 ]
Yao, Jiafeng [1 ]
Zhu, Jinhui [2 ]
He, Xiongxiong [1 ]
Jiang, Qianru [1 ]
机构
[1] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou, Peoples R China
[2] Zhejiang Univ, Sch Med, Affiliated Hosp 2, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
COMPUTER-AIDED DIAGNOSIS; LESIONS;
D O I
10.1049/ipr2.12495
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic classification of diseases in endoscopic images is essential to the improvement of diagnostic performance and the reduction of colorectal cancer mortality. However, due to the ambiguous boundary between background and foreground, abnormal classification in endoscopic images is still challenging. To tackle such a situation, an adaptive aggregation with self-attention network (AASAN), including a global branch, a local branch, and a fusion branch, is proposed imitating the diagnosis process of endoscopists. On this basis, the self-attention with relative position encoding (SA-RPE) module is designed to capture long-range dependencies and gather lesion neighborhood information. Furthermore, an adaptive aggregation feature (AAF) module is proposed and embedded into the fusion branch for final image label prediction, which is helpful to capture more discriminant features. Extensive experiments show that the classification accuracy of the authors' method on Kvasir public dataset reaches 96.37% in a fivefold cross-validation, higher than the state-of-the-art deep learning algorithms.
引用
收藏
页码:2384 / 2397
页数:14
相关论文
共 50 条
  • [41] Adaptive pixel attention network for hyperspectral image classification
    Zhao, Yuefeng
    Zai, Chengmin
    Hu, Nannan
    Shi, Lu
    Zhou, Xue
    Sun, Jingqi
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [42] Deformable Self-Attention for Text Classification
    Ma, Qianli
    Yan, Jiangyue
    Lin, Zhenxi
    Yu, Liuhong
    Chen, Zipeng
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1570 - 1581
  • [43] Research on a Capsule Network Text Classification Method with a Self-Attention Mechanism
    Yu, Xiaodong
    Luo, Shun-Nain
    Wu, Yujia
    Cai, Zhufei
    Kuan, Ta-Wen
    Tseng, Shih-Pang
    SYMMETRY-BASEL, 2024, 16 (05):
  • [44] Applying Self-attention for Stance Classification
    Bugueno, Margarita
    Mendoza, Marcelo
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS (CIARP 2019), 2019, 11896 : 51 - 61
  • [45] Biscale Convolutional Self-Attention Network for Hyperspectral Coastal Wetlands Classification
    Luo, Junshen
    He, Zhi
    Lin, Haomei
    Wu, Heqian
    IEEE Geoscience and Remote Sensing Letters, 2024, 21 : 1 - 5
  • [46] Biscale Convolutional Self-Attention Network for Hyperspectral Coastal Wetlands Classification
    Luo, Junshen
    He, Zhi
    Lin, Haomei
    Wu, Heqian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [47] Cross-Modal Self-Attention Network for Referring Image Segmentation
    Ye, Linwei
    Rochan, Mrigank
    Liu, Zhi
    Wang, Yang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10494 - 10503
  • [48] Research of Self-Attention in Image Segmentation
    Cao, Fude
    Zheng, Chunguang
    Huang, Limin
    Wang, Aihua
    Zhang, Jiong
    Zhou, Feng
    Ju, Haoxue
    Guo, Haitao
    Du, Yuxia
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)
  • [49] Improve Image Captioning by Self-attention
    Li, Zhenru
    Li, Yaoyi
    Lu, Hongtao
    NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 91 - 98
  • [50] Wavelet Frequency Division Self-Attention Transformer Image Deraining Network
    Fang, Siyan
    Liu, Bin
    Computer Engineering and Applications, 2024, 60 (06) : 259 - 273