A transformer-CNN parallel network for image guided depth completion

被引:5
|
作者
Li, Tao [1 ]
Dong, Xiucheng [1 ]
Lin, Jie [2 ]
Peng, Yonghong [3 ]
机构
[1] Xihua Univ, Sch Elect Engn & Elect Informat, Chengdu 610039, Peoples R China
[2] Xihua Univ, Sch Aeronaut & Astronaut, Chengdu 610039, Peoples R China
[3] Manchester Metropolitan Univ, Dept Comp & Math, Manchester M1 5GD, England
基金
中国国家自然科学基金;
关键词
Depth completion; Convolutional neural network; Transformer; Token correlation; Conditional random field;
D O I
10.1016/j.patcog.2024.110305
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image guided depth completion aims to predict a dense depth map from sparse depth measurements and the corresponding single color image. However, most state-of-the-art methods only rely on convolutional neural network (CNN) or transformer. In this paper, we propose a transformer -CNN parallel network (TCPNet) to integrate the advantages of CNN in local detail recovery and transformer in long-range semantic modeling. Specifically, our CNN branch adopts dense connection to strengthen feature propagation. Since the common transformer computes self -attention based on all the tokens in the window, no matter if they are relevant or not, this will inevitably introduce interferences and noises. To improve the self -attention accuracy, we propose a correlation -based transformer to only allow nearest neighbor tokens to participate in the self -attention computation. We also design a multi -scale conditional random field (CRF) module to implement multi -scale high -dimensional filtering for depth refinement. The comprehensive experimental results on KITTI and NYUv2 demonstrate that our method outperforms the state-of-the-art methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Heterogeneous feature-aware Transformer-CNN coupling network for person re-identification
    Li, Yanchao
    Lian, Guoyun
    Zhang, Wenyu
    Ma, Guanglin
    Ren, Jin
    Yang, Jinfeng
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [42] Multi-scale Transformer-CNN domain adaptation network for complex processes fault diagnosis
    Zhu, Qun-Xiong
    Qian, Yu -Shi
    Zhang, Ning
    He, Yan-Lin
    Xu, Yuan
    JOURNAL OF PROCESS CONTROL, 2023, 130
  • [43] A novel hybrid transformer-CNN architecture for environmental microorganism classification
    Shao, Ran
    Bi, Xiao-Jun
    Chen, Zheng
    PLOS ONE, 2022, 17 (11):
  • [44] AGG-Net: Attention Guided Gated-convolutional Network for Depth Image Completion
    Chen, Dongyue
    Huang, Tingxuan
    Song, Zhimin
    Deng, Shizhuo
    Jia, Tong
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 8819 - 8828
  • [45] Underwater Image Enhancement Based on Parallel Guidance of Transformer and CNN
    Chang, Jian
    Chen, Hongfu
    Wang, Bingbing
    Computer Engineering and Applications, 2024, 60 (04) : 280 - 288
  • [46] Hierarchical Decoder with Parallel Transformer and CNN for Medical Image Segmentation
    Li, Shijie
    Gong, Yu
    Xiang, Qingyuan
    Li, Zheng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XIV, 2025, 15044 : 133 - 147
  • [47] FOTCA: hybrid transformer-CNN architecture using AFNO for accurate plant leaf disease image recognition
    Hu, Bo
    Jiang, Wenqian
    Zeng, Juan
    Cheng, Chen
    He, Laichang
    FRONTIERS IN PLANT SCIENCE, 2023, 14
  • [48] U-TransCNN: A U-shape transformer-CNN fusion model for underwater image enhancement☆
    Yao, Haiyang
    Guo, Ruige
    Zhao, Zhongda
    Zang, Yuzhang
    Zhao, Xiaobo
    Lei, Tao
    Wang, Haiyan
    DISPLAYS, 2025, 88
  • [49] CNN and Transformer interaction network for hyperspectral image classification
    Li, Zhongwei
    Huang, Wenhao
    Wang, Leiquan
    Xin, Ziqi
    Meng, Qiao
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (18) : 5548 - 5573
  • [50] Transformer-CNN Automatic Hyperparameter Tuning for Speech Emotion Recognition
    Gumelar, Agustinus Bimo
    Yuniarno, Eko Mulyanto
    Adi, Derry Pramono
    Setiawan, Rudi
    Sugiarto, Indar
    Purnomo, Mauridhi Hery
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST 2022), 2022,