TM-GAN: A Transformer-Based Multi-Modal Generative Adversarial Network for Guided Depth Image Super-Resolution

被引:1
|
作者
Zhu, Jiang [1 ]
Koh, Van Kwan Zhi [1 ]
Lin, Zhiping [1 ]
Wen, Bihan [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 39798, Singapore
关键词
Transformers; Superresolution; Generative adversarial networks; Convolutional neural networks; Task analysis; Spatial resolution; Image reconstruction; Depth images; guided image super-resolution; vision transformer; generative adversarial network; RGB-D; MAP SUPERRESOLUTION; FUSION; 3D;
D O I
10.1109/JETCAS.2024.3394495
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite significant strides in deep single image super-resolution (SISR), the development of robust guided depth image super-resolution (GDSR) techniques presents a notable challenge. Effective GDSR methods must not only exploit the properties of the target image but also integrate complementary information from the guidance image. The state-of-the-art in guided image super-resolution has been dominated by convolutional neural network (CNN) based methods, which leverage CNN as their architecture. However, CNN has limitations in capturing global information effectively, and their traditional regression training techniques can sometimes lead to challenges in the precise generating of high-frequency details, unlike transformers that have shown remarkable success in deep learning through the self-attention mechanism. Drawing inspiration from the transformative impact of transformers in both language and vision applications, we propose a Transformer-based Multi-modal Generative Adversarial Network dubbed TM-GAN. TM-GAN is designed to effectively process and integrate multi-modal data, leveraging the global contextual understanding and detailed feature extraction capabilities of transformers within a GAN architecture for GDSR, aiming to effectively integrate and utilize multi-modal data sources. Experimental evaluations of TM-GAN on a variety of RGB-D datasets demonstrate its superiority over the state-of-the-art methods, showcasing its effectiveness in leveraging transformer-based techniques for GDSR.
引用
收藏
页码:261 / 274
页数:14
相关论文
共 50 条
  • [41] Learning Multi-Modal Cross-Scale Deformable Transformer Network for Unregistered Hyperspectral Image Super-resolution
    Dong, Wenqian
    Xu, Yang
    Qu, Jiahui
    Hou, Shaoxiong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1573 - 1581
  • [42] Depth Map Upsampling via Multi-Modal Generative Adversarial Network
    Tan, Daniel Stanley
    Lin, Jun-Ming
    Lai, Yu-Chi
    Ilao, Joel
    Hua, Kai-Lung
    SENSORS, 2019, 19 (07)
  • [43] Multi-Modal Prior-Guided Diffusion Model for Blind Image Super-Resolution
    Huang, Detian
    Song, Jiaxun
    Huang, Xiaoqian
    Hu, Zhenzhen
    Zeng, Huanqiang
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 316 - 320
  • [44] Latent Edge Guided Depth Super-Resolution Using Attention-Based Hierarchical Multi-Modal Fusion
    Lan, Hui
    Jung, Cheolkon
    IEEE ACCESS, 2024, 12 : 114512 - 114526
  • [45] Multi-modal Image Fusion for Multispectral Super-resolution in Microscopy
    Dey, Neel
    Li, Shijie
    Bermond, Katharina
    Heintzmann, Rainer
    Curcio, Christine A.
    Ach, Thomas
    Gerig, Guido
    MEDICAL IMAGING 2019: IMAGE PROCESSING, 2019, 10949
  • [46] A Transformer-Unet Generative Adversarial Network for the Super-Resolution Reconstruction of DEMs
    Zheng, Xin
    Xu, Zhaoqi
    Yin, Qian
    Bao, Zelun
    Chen, Zhirui
    Wang, Sizhu
    REMOTE SENSING, 2024, 16 (19)
  • [47] Generative adversarial network in wavelet domain for single image super-resolution
    Zhang, Fan
    Wang, Xinwei
    Cao, Lin
    Du, Kangning
    Guo, Yanan
    Journal of Computers (Taiwan), 2021, 32 (03) : 249 - 262
  • [48] CSRGAN: MEDICAL IMAGE SUPER-RESOLUTION USING A GENERATIVE ADVERSARIAL NETWORK
    Zhu, Yongpei
    Zhou, Zicong
    Liao, Guojun
    Yuan, Kehong
    2020 IEEE 17TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING WORKSHOPS (IEEE ISBI WORKSHOPS 2020), 2020,
  • [49] Generative Adversarial Network for Image Super-Resolution Combining Texture Loss
    Jiang, Yuning
    Li, Jinhua
    APPLIED SCIENCES-BASEL, 2020, 10 (05):
  • [50] Dual Discriminator Generative Adversarial Network for Single Image Super-Resolution
    Liu, Peng
    Hong, Ying
    Liu, Yan
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 680 - 687