Diffusion model-based text-guided enhancement network for medical image segmentation

被引：7

作者：

Dong, Zhiwei ^{[1
]}

Yuan, Genji ^{[1
]}

Hua, Zhen ^{[1
]}

Li, Jinjiang ^{[2
]}

机构：

[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai, Peoples R China

[2] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 249卷

基金：

中国国家自然科学基金;

关键词：

Denoising diffusion model; Text attention mechanism; Guided feature enhancement; Medical image segmentation; CONVOLUTIONAL NEURAL-NETWORK; CELL-NUCLEI; MISDIAGNOSIS; CLASSIFICATION;

D O I：

10.1016/j.eswa.2024.123549

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, denoising diffusion models have achieved remarkable success in generating pixel-level representations with semantic values for image generation modeling. In this study, we propose a novel end -toend framework, called TGEDiff, focusing on medical image segmentation. TGEDiff fuses a textual attention mechanism with the diffusion model by introducing an additional auxiliary categorization task to guide the diffusion model with textual information to generate excellent pixel-level representations. To overcome the limitation of limited perceptual fields for independent feature encoders within the diffusion model, we introduce a multi-kernel excitation module to extend the model's perceptual capability. Meanwhile, a guided feature enhancement module is introduced in Denoising-UNet to focus the model's attention on important regions and attenuate the influence of noise and irrelevant background in medical images. We critically evaluated TGEDiff on three datasets (Kvasir-SEG, Kvasir-Sessile, and GLaS), and TGEDiff achieved significant improvements over the state -of -the -art approach on all three datasets, with F1 scores and mIoU improving by 0.88% and 1.09%, 3.21% and 3.43%, respectively, 1.29% and 2.34%. These data validate that TGEDiff has excellent performance in medical image segmentation. TGEDiff is expected to facilitate accurate diagnosis and treatment of medical diseases through more precise deconvolutional structural segmentation.

引用

页数：18

共 50 条

[1] Enhancing Label-Efficient Medical Image Segmentation with Text-Guided Diffusion Models
Feng, Chun-Mei
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 : 253 - 262
[2] ABP: Asymmetric Bilateral Prompting for Text-Guided Medical Image Segmentation
Zeng, Xinyi
Zeng, Pinxian
Cui, Jiaqi
Li, Aibing
Liu, Bo
Wang, Chengdi
Wang, Yan
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX, 2024, 15009 : 54 - 64
[3] GENERATIVE ADVERSARIAL NETWORK INCLUDING REFERRING IMAGE SEGMENTATION FOR TEXT-GUIDED IMAGE MANIPULATION
Watanabe, Yuto
Togo, Ren
Maeda, Keisuke
Ogawa, Takahiro
Haseyama, Miki
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4818 - 4822
[4] Text-Guided Cross-Position Attention for Segmentation: Case of Medical Image
Lee, Go-Eun
Kim, Seon Ho
Cho, Jungchan
Choi, Sang Tae
Choi, Sang-Il
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT V, 2023, 14224 : 537 - 546
[5] Text-Guided Multi-region Scene Image Editing Based on Diffusion Model
Li, Ruichen
Wu, Lei
Wang, Changshuo
Dong, Pei
Li, Xin
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XI, ICIC 2024, 2024, 14872 : 229 - 240
[6] SEGMENTATION-AWARE TEXT-GUIDED IMAGE MANIPULATION
Haruyama, Tomoki
Togo, Ren
Maeda, Keisuke
Ogawa, Takahiro
Haseyama, Miki
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2433 - 2437
[7] Text-Guided Image Manipulation via Generative Adversarial Network With Referring Image Segmentation-Based Guidance
Watanabe, Yuto
Togo, Ren
Maeda, Keisuke
Ogawa, Takahiro
Haseyama, Miki
IEEE ACCESS, 2023, 11 : 42534 - 42545
[8] Text-guided image-to-sketch diffusion models☆
Ke, Aihua
Huang, Yujie
Cai, Bo
Yang, Jie
KNOWLEDGE-BASED SYSTEMS, 2024, 304
[9] Common Vision-Language Attention for Text-Guided Medical Image Segmentation of Pneumonia
Guo, Yunpeng
Zeng, Xinyi
Zeng, Pinxian
Fei, Yuchen
Wen, Lu
Zhou, Jiliu
Wang, Yan
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX, 2024, 15009 : 192 - 201
[10] Text-Guided Attention Model for Image Captioning
Mun, Jonghwan
Cho, Minsu
Han, Bohyung
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4233 - 4239

← 1 2 3 4 5 →