TM-GAN: A Transformer-Based Multi-Modal Generative Adversarial Network for Guided Depth Image Super-Resolution

被引:1
|
作者
Zhu, Jiang [1 ]
Koh, Van Kwan Zhi [1 ]
Lin, Zhiping [1 ]
Wen, Bihan [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 39798, Singapore
关键词
Transformers; Superresolution; Generative adversarial networks; Convolutional neural networks; Task analysis; Spatial resolution; Image reconstruction; Depth images; guided image super-resolution; vision transformer; generative adversarial network; RGB-D; MAP SUPERRESOLUTION; FUSION; 3D;
D O I
10.1109/JETCAS.2024.3394495
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite significant strides in deep single image super-resolution (SISR), the development of robust guided depth image super-resolution (GDSR) techniques presents a notable challenge. Effective GDSR methods must not only exploit the properties of the target image but also integrate complementary information from the guidance image. The state-of-the-art in guided image super-resolution has been dominated by convolutional neural network (CNN) based methods, which leverage CNN as their architecture. However, CNN has limitations in capturing global information effectively, and their traditional regression training techniques can sometimes lead to challenges in the precise generating of high-frequency details, unlike transformers that have shown remarkable success in deep learning through the self-attention mechanism. Drawing inspiration from the transformative impact of transformers in both language and vision applications, we propose a Transformer-based Multi-modal Generative Adversarial Network dubbed TM-GAN. TM-GAN is designed to effectively process and integrate multi-modal data, leveraging the global contextual understanding and detailed feature extraction capabilities of transformers within a GAN architecture for GDSR, aiming to effectively integrate and utilize multi-modal data sources. Experimental evaluations of TM-GAN on a variety of RGB-D datasets demonstrate its superiority over the state-of-the-art methods, showcasing its effectiveness in leveraging transformer-based techniques for GDSR.
引用
收藏
页码:261 / 274
页数:14
相关论文
共 50 条
  • [21] Terahertz image super-resolution restoration using a hybrid-Transformer-based generative adversarial network
    Wu, Heng
    Zheng, Jing
    He, Chunhua
    Xiao, Huapan
    Luo, Shaojuan
    OPTICS AND LASERS IN ENGINEERING, 2025, 189
  • [22] Fusformer: A Transformer-Based Fusion Network for Hyperspectral Image Super-Resolution
    Hu, Jin-Fan
    Huang, Ting-Zhu
    Deng, Liang-Jian
    Dou, Hong-Xia
    Hong, Danfeng
    Vivone, Gemine
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [23] DAW-GAN: a generative adversarial network based on the dynamic adaptive weight for image super-resolution
    Xia, Tingyu
    Yang, Xin
    Zhu, Yitian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (25) : 67199 - 67211
  • [24] FG-SRGAN: A Feature-Guided Super-Resolution Generative Adversarial Network for Unpaired Image Super-Resolution
    Lian, Shuailong
    Zhou, Hejian
    Sun, Yi
    ADVANCES IN NEURAL NETWORKS - ISNN 2019, PT I, 2019, 11554 : 151 - 161
  • [25] Retinal fundus image super-resolution based on generative adversarial network guided with vascular structure prior
    Jia, Yanfei
    Chen, Guangda
    Chi, Haotian
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [26] Dual prior guided depth image super-resolution with multi-scale transformer fusion network
    Zhao, Pengfei
    Ji, Jianhua
    Wen, Yang
    Shi, Wuzhen
    Cao, Wenming
    VISUAL COMPUTER, 2025,
  • [27] Improved generative adversarial network for retinal image super-resolution
    Qiu, Defu
    Cheng, Yuhu
    Wang, Xuesong
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 225
  • [28] A lightweight generative adversarial network for single image super-resolution
    Lu, Xinbiao
    Xie, Xupeng
    Ye, Chunlin
    Xing, Hao
    Liu, Zecheng
    Cai, Changchun
    VISUAL COMPUTER, 2024, 40 (01): : 41 - 52
  • [29] Image super-resolution using conditional generative adversarial network
    Qiao, Jiaojiao
    Song, Huihui
    Zhang, Kaihua
    Zhang, Xiaolu
    Liu, Qingshan
    IET IMAGE PROCESSING, 2019, 13 (14) : 2673 - 2679
  • [30] MULTIRESOLUTION MIXTURE GENERATIVE ADVERSARIAL NETWORK FOR IMAGE SUPER-RESOLUTION
    Wang, Yudiao
    Lan, Xuguang
    Zhang, Yinshu
    Miao, Ruixue
    Tian, Zhiqiang
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,