Kernel Transformer Networks for Compact Spherical Convolution

被引:73
|
作者
Su, Yu-Chuan [1 ]
Grauman, Kristen [1 ,2 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
[2] Facebook AI Res, New York, NY USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00967
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ideally, 360 degrees imagery could inherit the deep convolutional neural networks (CNNs) already trained with great success on perspective projection images. However, existing methods to transfer CNNs from perspective to spherical images introduce significant computational costs and/or degradations in accuracy. We present the Kernel Transformer Network (KTN) to efficiently transfer convolution kernels from perspective images to the equirectangular projection of 360 degrees images. Given a source CNN for perspective images as input, the KTN produces a function parameterized by a polar angle and kernel as output. Given a novel 360 degrees image, that function in turn can compute convolutions for arbitrary layers and kernels as would the source CNN on the corresponding tangent plane projections. Distinct from all existing methods, KTNs allow model transfer: the same model can be applied to different source CNNs with the same base architecture. This enables application to multiple recognition tasks without re-training the KTN. Validating our approach with multiple source CNNs and datasets, we show that KTNs improve the state of the art for spherical convolution. KTNs successfully preserve the source CNN's accuracy, while offering transferability, scalability to typical image resolutions, and, in many cases, a substantially lower memory footprint(1).
引用
收藏
页码:9434 / 9443
页数:10
相关论文
共 50 条
  • [41] A FPGA implementation of variable kernel convolution
    Sriram, Vinay
    Kearney, David
    EIGHTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2007, : 105 - 109
  • [42] Time Series Convolution Kernel Estimation
    Dvorak, Marek
    INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM 2017), 2018, 1978
  • [43] Human motion recognition with a convolution kernel
    Cao, Dongwei
    Masoud, Osama T.
    Boley, Daniel
    2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, : 4270 - +
  • [44] A convolution kernel method for color recognition
    Son, Jeong-Woo
    Park, Seong-Bae
    Kim, Ku-Jin
    ALPIT 2007: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, 2007, : 242 - +
  • [45] Breaking Barriers in Cancer Diagnosis: Super-Light Compact Convolution Transformer for Colon and Lung Cancer Detection
    Maurya, Ritesh
    Pandey, Nageshwar Nath
    Karnati, Mohan
    Sahu, Geet
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (05)
  • [46] Transformer with Convolution for Irregular Image Inpainting
    Xie, Donglin
    Wang, Lingfeng
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 35 - 38
  • [47] MDCT: Multi-Kernel Dilated Convolution and Transformer for One-Stage Object Detection of Remote Sensing Images
    Chen, Juanjuan
    Hong, Hansheng
    Song, Bin
    Guo, Jie
    Chen, Chen
    Xu, Junjie
    REMOTE SENSING, 2023, 15 (02)
  • [48] Optimization of optical convolution kernel of optoelectronic hybrid convolution neural network
    XU Xiaofeng
    ZHU Lianqing
    ZHUANG Wei
    ZHANG Dongliang
    LU Lidan
    YUAN Pei
    Optoelectronics Letters, 2022, 18 (03) : 181 - 186
  • [49] Optimization of optical convolution kernel of optoelectronic hybrid convolution neural network
    Xiaofeng Xu
    Lianqing Zhu
    Wei Zhuang
    Dongliang Zhang
    Lidan Lu
    Pei Yuan
    Optoelectronics Letters, 2022, 18 : 181 - 186
  • [50] Speech Emotion Recognition Using Convolution Neural Networks and Multi-Head Convolutional Transformer
    Ullah, Rizwan
    Asif, Muhammad
    Shah, Wahab Ali
    Anjam, Fakhar
    Ullah, Ibrar
    Khurshaid, Tahir
    Wuttisittikulkij, Lunchakorn
    Shah, Shashi
    Ali, Syed Mansoor
    Alibakhshikenari, Mohammad
    SENSORS, 2023, 23 (13)