An attention-guided multi-scale fusion network for surgical instrument segmentation

被引:0
|
作者
Song, Mengqiu [1 ]
Zhai, Chenxu [1 ]
Yang, Lei [1 ]
Liu, Yanhong [1 ]
Bian, Guibin [2 ]
机构
[1] Zhengzhou Univ, Sch Elect & Informat Engn, Zhengzhou 450001, Henan, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
关键词
Surgical instrument segmentation; Dual attention fusion; Context feature fusion; Adaptive multi-scale feature fusion;
D O I
10.1016/j.bspc.2024.107296
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
In contemporary surgical practice, minimally invasive surgery has significantly alleviated the physiological and psychological strain on patients while dramatically curtailing their recovery periods. Within the realm of robot-assisted minimally invasive surgery, the precise segmentation of surgical instruments assumes paramount importance, as it not only enhances the precision with which surgeons execute surgical maneuvers but also fortifies the overall perioperative safety of patients. Despite these benefits, the accurate segmentation of surgical instruments remains beset by a multitude of challenges, emanating primarily from the intricacy of the surgical milieu, specular reflection, diverse instruments, etc. To efficaciously confront these challenges, this paper introduces a novel attention-guided multi-scale fusion network. Specifically, to facilitate effective feature representation, an effective backbone network leveraging Octave convolution is constructed to mitigate feature redundancy. Simultaneously, the encoding path incorporates the Transformer module into bottleneck layer to infuse global contextual information, thereby synergistically capturing both global and local feature information. Moreover, a dual attention fusion block and a context feature fusion block are ingeniously integrated into the skip connections to refine local features, to meticulously discern edge details and effectively suppress the interference of useless information. Lastly, this paper presents an adaptive multi-Scale feature weighting block, which adeptly fuses multi-scale features from disparate layers within the decoding path. To rigorously substantiate the performance of proposed model, comprehensive experimentation is conducted on two widely recognized benchmark datasets. The results reach a Dice score of 96.34% and a mIOU value of 96.14% on kvasir-instrument dataset. Meanwhile, it also reaches a Dice score of 97.31% and a mIOU value of 96.15% on Endovis2017 dataset. Experiments show that it attests to the substantial superiority of proposed network in terms of accuracy and robustness against with advanced segmentation models. Therefore, proposed model could offer a promising solution to enhance the precision and safety of robot-assisted minimally invasive surgeries.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Multiscale Attention-Guided Panoptic Segmentation Network
    Fu, Du
    Qu, Shaojun
    Fu, Ya
    Computer Engineering and Applications, 2023, 59 (22) : 223 - 232
  • [42] Multi-scale Spatial-Spectral Attention Guided Fusion Network for Pansharpening
    Yang, Yong
    Li, Mengzhen
    Huang, Shuying
    Lu, Hangyuan
    Tu, Wei
    Wan, Weiguo
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3346 - 3354
  • [43] Attention-Guided Network for Semantic Video Segmentation
    Li, Jiangyun
    Zhao, Yikai
    Fu, Jun
    Wu, Jiajia
    Liu, Jing
    IEEE ACCESS, 2019, 7 : 140680 - 140689
  • [44] Attention-Guided Multi-Scale Segmentation Neural Network for Interactive Extraction of Region Objects from High-Resolution Satellite Imagery
    Li, Kun
    Hu, Xiangyun
    Jiang, Huiwei
    Shu, Zhen
    Zhang, Mi
    REMOTE SENSING, 2020, 12 (05)
  • [45] SMANet: Superpixel-guided multi-scale attention network for medical image segmentation
    Shen, Yiwei
    Guo, Junchen
    Liu, Yan
    Xu, Chang
    Li, Qingwu
    Qi, Fei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [46] Attention-guided Multi-step Fusion: A Hierarchical Fusion Network for Multimodal Recommendation
    Zhou, Yan
    Guo, Jie
    Sun, Hao
    Song, Bin
    Yu, Fei Richard
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1816 - 1820
  • [47] Camouflaged Object Detection Based on Deep Learning with Attention-Guided Edge Detection and Multi-Scale Context Fusion
    Wen, Yalin
    Ke, Wei
    Sheng, Hao
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [48] Multi-Scale Spatial Attention-Guided Monocular Depth Estimation With Semantic Enhancement
    Xu, Xianfa
    Chen, Zhe
    Yin, Fuliang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8811 - 8822
  • [49] MAIL: Multi-Scale Attention-Guided Indoor Localization Using Geomagnetic Sequences
    Niu, Qun
    He, Tao
    Liu, Ning
    He, Suining
    Luo, Xiaonan
    Zhou, Fan
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2020, 4 (02):
  • [50] A hybrid attention multi-scale fusion network for real-time semantic segmentation
    Ye, Baofeng
    Xue, Renzheng
    Wu, Qianlong
    SCIENTIFIC REPORTS, 2025, 15 (01):