U-TransCNN: A U-shape transformer-CNN fusion model for underwater image enhancement☆

被引：0

作者：

Yao, Haiyang ^{[1
]}

Guo, Ruige ^{[1
]}

Zhao, Zhongda ^{[4
]}

Zang, Yuzhang ^{[2
]}

Zhao, Xiaobo ^{[3
]}

Lei, Tao ^{[1
]}

Wang, Haiyan ^{[1
,4
]}

机构：

[1] Shaanxi Univ Sci & Technol, Sch Elect Informat & Artificial Intelligence, Xian 710016, Peoples R China

[2] Western Washington Univ, Engn & Design Dept, Bellingham, WA USA

[3] Aarhus Univ, Dept Elect & Comp Engn, DK-8200 Aarhus, Denmark

[4] Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R China

来源：

DISPLAYS | 2025年 / 88卷

关键词：

Underwater image enhancement; Feature fusion; Transformer; CNN;

D O I：

10.1016/j.displa.2025.103047

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Underwater imaging faces significant challenges due to nonuniform optical absorption and scattering, resulting in visual quality issues like color distortion, contrast reduction, and image blurring. These factors hinder the accurate capture and clear depiction of underwater imagery. To address these complexities, we propose UTransCNN, a U-shape Transformer- Convolutional Neural Networks (CNN) model, designed to enhance underwater images by integrating the strengths of CNNs and Transformers. The core of U-TransCNN is the GlobalDetail Feature Synchronization Fusion Module. This innovative component enhances global color and contrast while meticulously preserving the intricate texture details, ensuring that both macroscopic and microscopic aspects of the image are enhanced in unison. Then we design the Multiscale Detail Fusion Block to aggregate a richer spectrum of feature information using a variety of convolution kernels. Furthermore, our optimization strategy is augmented with a joint loss function, adynamic approach allowing the model to assign varying weights to the loss associated with different pixel points, depending on their loss magnitude. Six experiments (including reference and non-reference) on three public underwater datasets confirm that U-TransCNN comprehensively surpasses other contemporary state-of-the-art deep learning algorithms, demonstrating marked improvement in visualization quality and quantization parameters of underwater images. Our code is available at https://github.com/GuoRuige/UTransCNN.

引用

页数：13

共 50 条

[11] LightingFormer: Transformer-CNN hybrid network for low-light image enhancement
Bi, Cong
Qian, Wenhua
Cao, Jinde
Wang, Xue
COMPUTERS & GRAPHICS-UK, 2024, 124
[12] TCPCNet: a transformer-CNN parallel cooperative network for low-light image enhancement
Wanjun Zhang
Yujie Ding
Miaohui Zhang
Yonghua Zhang
Lvchen Cao
Ziqing Huang
Jun Wang
Multimedia Tools and Applications, 2024, 83 : 52957 - 52972
[13] Enhancement of Underwater Images through Parallel Fusion of Transformer and CNN
Liu, Xiangyong
Chen, Zhixin
Xu, Zhiqiang
Zheng, Ziwei
Ma, Fengshuang
Wang, Yunjie
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (09)
[14] U-SAS: U-Shape Network With Multilevel Enhancement and Global Decoding for Synthetic Aperture Sonar Image Semantic Segmentation
Li, Jiayuan
Wang, Zhen
You, Zhuhong
Zhao, Zhengyang
Yuan, Zhanbin
IEEE SENSORS JOURNAL, 2025, 25 (01) : 1799 - 1813
[15] Underwater Image Enhancement Based on Parallel Guidance of Transformer and CNN
Chang, Jian
Chen, Hongfu
Wang, Bingbing
Computer Engineering and Applications, 2024, 60 (04) : 280 - 288
[16] EFFICIENT U-SHAPE INVERTIBLE NEURAL NETWORK FOR IMAGE STEGANOGRAPHY
Zhang, Le
Li, Tong
Lu, Yao
Hou, Mixiao
Lu, Guangming
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
[17] TCPCNet: a transformer-CNN parallel cooperative network for low-light image enhancement
Zhang, Wanjun
Ding, Yujie
Zhang, Miaohui
Zhang, Yonghua
Cao, Lvchen
Huang, Ziqing
Wang, Jun
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 52957 - 52972
[18] Multi-Scale U-Shape MLP for Hyperspectral Image Classification
Lin, Moule
Jing, Weipeng
Di, Donglin
Chen, Guangsheng
Song, Houbing
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[19] TCIA: A Transformer-CNN Model With Illumination Adaptation for Enhancing Cell Image Saliency and Contrast
Yang, Jietao
Huang, Guoheng
Luo, Yanzhang
Zhang, Xiaofeng
Yuan, Xiaochen
Chen, Xuhang
Pun, Chi-Man
Cai, Mu-Yan
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
[20] Dr-SAM: U-Shape Structure Segment Anything Model for Generalizable Medical Image Segmentation
Huo, Xiangzuo
Tian, Shengwei
Zhou, Bingming
Yu, Long
Li, Aolun
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VII, ICIC 2024, 2024, 14868 : 197 - 207

← 1 2 3 4 5 →