Kernel Transformer Networks for Compact Spherical Convolution

被引:73
|
作者
Su, Yu-Chuan [1 ]
Grauman, Kristen [1 ,2 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
[2] Facebook AI Res, New York, NY USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00967
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ideally, 360 degrees imagery could inherit the deep convolutional neural networks (CNNs) already trained with great success on perspective projection images. However, existing methods to transfer CNNs from perspective to spherical images introduce significant computational costs and/or degradations in accuracy. We present the Kernel Transformer Network (KTN) to efficiently transfer convolution kernels from perspective images to the equirectangular projection of 360 degrees images. Given a source CNN for perspective images as input, the KTN produces a function parameterized by a polar angle and kernel as output. Given a novel 360 degrees image, that function in turn can compute convolutions for arbitrary layers and kernels as would the source CNN on the corresponding tangent plane projections. Distinct from all existing methods, KTNs allow model transfer: the same model can be applied to different source CNNs with the same base architecture. This enables application to multiple recognition tasks without re-training the KTN. Validating our approach with multiple source CNNs and datasets, we show that KTNs improve the state of the art for spherical convolution. KTNs successfully preserve the source CNN's accuracy, while offering transferability, scalability to typical image resolutions, and, in many cases, a substantially lower memory footprint(1).
引用
收藏
页码:9434 / 9443
页数:10
相关论文
共 50 条
  • [21] A novel Transformer-based model with large kernel temporal convolution for chemical process fault detection
    Zhu, Zhichao
    Chen, Feiyang
    Ni, Lei
    Bian, Haitao
    Jiang, Juncheng
    Chen, Zhiquan
    COMPUTERS & CHEMICAL ENGINEERING, 2024, 188
  • [22] Transformer with large convolution kernel decoder network for salient object detection in optical remote sensing images
    Dong, Pengwei
    Wang, Bo
    Cong, Runmin
    Sun, Hai-Han
    Li, Chongyi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
  • [23] CONVOLUTION AND DECONVOLUTION WITH GAUSSIAN KERNEL
    FANG, TM
    SHEI, SS
    NAGEM, RJ
    SANDRI, GV
    NUOVO CIMENTO DELLA SOCIETA ITALIANA DI FISICA B-GENERAL PHYSICS RELATIVITY ASTRONOMY AND MATHEMATICAL PHYSICS AND METHODS, 1994, 109 (01): : 83 - 92
  • [24] Introducing frequency representation into convolution neural networks for medical image segmentation via twin-Kernel Fourier convolution
    Tang, Xianlun
    Peng, Jiangping
    Zhong, Bing
    Li, Jie
    Yan, Zhenfu
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 205
  • [25] KPConvX: Modernizing Kernel Point Convolution with Kernel Attention
    Thomas, Hugues
    Tsai, Yao-Hung Hubert
    Barfoot, Timothy D.
    Zhang, Jian
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 5525 - 5535
  • [26] Transformer-based cascade networks with spatial and channel reconstruction convolution for deepfake detection
    Li, Xue
    Zhou, Huibo
    Zhao, Ming
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (03) : 4142 - 4164
  • [27] Synergistic spectral and spatial feature analysis with transformer and convolution networks for hyperspectral image classification
    Yadav, Dhirendra Prasad
    Kumar, Deepak
    Jalal, Anand Singh
    Kumar, Ankit
    Kada, B.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 2975 - 2990
  • [28] MCT-TTE: Travel Time Estimation Based on Transformer and Convolution Neural Networks
    Liu, Fengkai
    Yang, Jianhua
    Li, Mu
    Wang, Kuo
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [29] Graph transformer based dynamic multiple graph convolution networks for traffic flow forecasting
    Hu, Yongli
    Peng, Ting
    Guo, Kan
    Sun, Yanfeng
    Gao, Junbin
    Yin, Baocai
    IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (09) : 1835 - 1845
  • [30] Synergistic spectral and spatial feature analysis with transformer and convolution networks for hyperspectral image classification
    Dhirendra Prasad Yadav
    Deepak Kumar
    Anand Singh Jalal
    Ankit Kumar
    B. Kada
    Signal, Image and Video Processing, 2024, 18 : 2975 - 2990