MMNet: A Mixing Module Network for Polyp Segmentation

被引:0
|
作者
Ghimire, Raman [1 ]
Lee, Sang-Woong [2 ]
机构
[1] Gachon Univ, Dept IT Convergence Engn, Pattern Recognit & Machine Learning Lab, Seongnam 13557, South Korea
[2] Gachon Univ, Dept AI Software, Pattern Recognit & Machine Learning Lab, Seongnam 13557, South Korea
基金
新加坡国家研究基金会;
关键词
polyp segmentation; transformer; computational complexity; depth-wise and 1 x 1 convolution; mixing module;
D O I
10.3390/s23167258
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Traditional encoder-decoder networks like U-Net have been extensively used for polyp segmentation. However, such networks have demonstrated limitations in explicitly modeling long-range dependencies. In such networks, local patterns are emphasized over the global context, as each convolutional kernel focuses on only a local subset of pixels in the entire image. Several recent transformer-based networks have been shown to overcome such limitations. Such networks encode long-range dependencies using self-attention methods and thus learn highly expressive representations. However, due to the computational complexity of modeling the whole image, self-attention is expensive to compute, as there is a quadratic increment in cost with the increase in pixels in the image. Thus, patch embedding has been utilized, which groups small regions of the image into single input features. Nevertheless, these transformers still lack inductive bias, even with the image as a 1D sequence of visual tokens. This results in the inability to generalize to local contexts due to limited low-level features. We introduce a hybrid transformer combined with a convolutional mixing network to overcome computational and long-range dependency issues. A pretrained transformer network is introduced as a feature-extracting encoder, and a mixing module network (MMNet) is introduced to capture the long-range dependencies with a reduced computational cost. Precisely, in the mixing module network, we use depth-wise and 1 x 1 convolution to model long-range dependencies to establish spatial and cross-channel correlation, respectively. The proposed approach is evaluated qualitatively and quantitatively on five challenging polyp datasets across six metrics. Our MMNet outperforms the previous best polyp segmentation methods.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Shallow and reverse attention network for colon polyp segmentation
    Go-Eun Lee
    Jungchan Cho
    Sang-II Choi
    Scientific Reports, 13
  • [22] A frequency attention-embedded network for polyp segmentation
    Tang, Rui
    Zhao, Hejing
    Tong, Yao
    Mu, Ruihui
    Wang, Yuqiang
    Zhang, Shuhao
    Zhao, Yao
    Wang, Weidong
    Zhang, Min
    Liu, Yilin
    Gao, Jianbo
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [23] Learnable Oriented-Derivative Network for Polyp Segmentation
    Cheng, Mengjun
    Kong, Zishang
    Song, Guoli
    Tian, Yonghong
    Liang, Yongsheng
    Chen, Jie
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 720 - 730
  • [24] Shallow and reverse attention network for colon polyp segmentation
    Lee, Go-Eun
    Cho, Jungchan
    Choi, Sang-, II
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [25] Colorectal polyp segmentation algorithm using DoubleUNet network
    Xu C.
    Yi J.
    Cao F.
    Fang W.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (08): : 970 - 983
  • [26] Dual-Domain Learning Network for Polyp Segmentation
    Li, Yan
    Zheng, Zhuoran
    Ren, Wenqi
    Nie, Yunfeng
    Zhang, Jingang
    Jia, Xiuyi
    DIGITAL FORENSICS AND WATERMARKING, IWDW 2023, 2024, 14511 : 233 - 247
  • [27] Polyp Segmentation via Semantic Enhanced Perceptual Network
    Wang, Tong
    Qi, Xiaoming
    Yang, Guanyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12594 - 12607
  • [28] Boundary Refinement Network for Polyp Segmentation With Deformable Attention
    Li, Zijian
    Yang, Zhiyong
    Wu, Wangsheng
    Guo, Zihang
    Zhu, Dongdong
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 121 - 125
  • [29] PPNet: Pyramid pooling based network for polyp segmentation
    Hu, Keli
    Chen, Wenping
    Sun, YuanZe
    Hu, Xiaozhao
    Zhou, Qianwei
    Zheng, Zirui
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 160
  • [30] PRFNet: Progressive Region Focusing Network for Polyp Segmentation
    Chen, Jilong
    Cheng, Junlong
    Jiang, Lei
    Yin, Pengyu
    Wang, Guoan
    Zhu, Min
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT V, 2024, 14429 : 394 - 406