TM-GAN: A Transformer-Based Multi-Modal Generative Adversarial Network for Guided Depth Image Super-Resolution

被引:1
|
作者
Zhu, Jiang [1 ]
Koh, Van Kwan Zhi [1 ]
Lin, Zhiping [1 ]
Wen, Bihan [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 39798, Singapore
关键词
Transformers; Superresolution; Generative adversarial networks; Convolutional neural networks; Task analysis; Spatial resolution; Image reconstruction; Depth images; guided image super-resolution; vision transformer; generative adversarial network; RGB-D; MAP SUPERRESOLUTION; FUSION; 3D;
D O I
10.1109/JETCAS.2024.3394495
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite significant strides in deep single image super-resolution (SISR), the development of robust guided depth image super-resolution (GDSR) techniques presents a notable challenge. Effective GDSR methods must not only exploit the properties of the target image but also integrate complementary information from the guidance image. The state-of-the-art in guided image super-resolution has been dominated by convolutional neural network (CNN) based methods, which leverage CNN as their architecture. However, CNN has limitations in capturing global information effectively, and their traditional regression training techniques can sometimes lead to challenges in the precise generating of high-frequency details, unlike transformers that have shown remarkable success in deep learning through the self-attention mechanism. Drawing inspiration from the transformative impact of transformers in both language and vision applications, we propose a Transformer-based Multi-modal Generative Adversarial Network dubbed TM-GAN. TM-GAN is designed to effectively process and integrate multi-modal data, leveraging the global contextual understanding and detailed feature extraction capabilities of transformers within a GAN architecture for GDSR, aiming to effectively integrate and utilize multi-modal data sources. Experimental evaluations of TM-GAN on a variety of RGB-D datasets demonstrate its superiority over the state-of-the-art methods, showcasing its effectiveness in leveraging transformer-based techniques for GDSR.
引用
收藏
页码:261 / 274
页数:14
相关论文
共 50 条
  • [1] Spatial Transformer Generative Adversarial Network for Image Super-Resolution
    Rempakos, Pantelis
    Vrigkas, Michalis
    Plissiti, Marina E.
    Nikou, Christophoros
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 : 399 - 411
  • [2] Information sparsity guided transformer for multi-modal medical image super-resolution
    Lu, Haotian
    Mei, Jie
    Qiu, Yu
    Li, Yumeng
    Hao, Fangwei
    Xu, Jing
    Tang, Lin
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 261
  • [3] Spatial Transformer Generative Adversarial Network for Robust Image Super-Resolution
    Kasem, Hossam M.
    Hung, Kwok-Wai
    Jiang, Jianmin
    IEEE ACCESS, 2019, 7 : 182993 - 183009
  • [4] Degradation-Guided Multi-Modal Fusion Network for Depth Map Super-Resolution
    Han, Lu
    Wang, Xinghu
    Zhou, Fuhui
    Wu, Diansheng
    ELECTRONICS, 2024, 13 (20)
  • [5] An Iris Image Super-Resolution Model Based on Swin Transformer and Generative Adversarial Network
    Lu, Hexin
    Zhu, Xiaodong
    Cui, Jingwei
    Jiang, Haifeng
    ALGORITHMS, 2024, 17 (03)
  • [6] Image Super-Resolution Reconstruction Based on a Generative Adversarial Network
    Wu, Yun
    Lan, Lin
    Long, Huiyun
    Kong, Guangqian
    Duan, Xun
    Xu, Changzhuan
    IEEE ACCESS, 2020, 8 : 215133 - 215144
  • [7] Image super-resolution based on conditional generative adversarial network
    Gao, Hongxia
    Chen, Zhanhong
    Huang, Binyang
    Chen, Jiahe
    Li, Zhifu
    IET IMAGE PROCESSING, 2020, 14 (13) : 3006 - 3013
  • [8] Mars image super-resolution based on generative adversarial network
    Wang, Cong
    Zhang, Yin
    Zhang, Yongqiang
    Tian, Rui
    Ding, Mingli
    Zhang, Yongqiang (yongqiang.zhang.hit@gmail.com); Ding, Mingli (mingli.ding.hit@gmail.com), 1600, Institute of Electrical and Electronics Engineers Inc. (09): : 108889 - 108898
  • [9] Image Super-resolution Reconstructing based on Generative Adversarial Network
    Nan Jing
    Bo Lei
    AI IN OPTICS AND PHOTONICS (AOPC 2019), 2019, 11342
  • [10] Mars Image Super-Resolution Based on Generative Adversarial Network
    Wang, Cong
    Zhang, Yin
    Zhang, Yongqiang
    Tian, Rui
    Ding, Mingli
    IEEE ACCESS, 2021, 9 : 108889 - 108898