Deformable convolutional networks for multi-view 3D shape classification

被引:12
|
作者
Ma, Pengfei [1 ]
Ma, Jie [1 ]
Wang, Xujiao [1 ]
Yang, Lichuang [1 ]
Wang, Nannan [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China
关键词
learning (artificial intelligence); image classification; feature extraction; image representation; feedforward neural nets; computational geometry; deformable convolutional networks; multiview 3D shape classification; geometric transformation modelling capability; multiview convolutional networks; view-pooling layer; deformable convolutional layer; input; deformable 3D shape classification problems; MVCNN framework; ModelNet10; dataset; ModelNet40;
D O I
10.1049/el.2018.6851
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This Letter suggests a novel method for improving the robustness and the geometric transformation modelling capability in multi-view convolutional networks (MVCNNs). First, the deformable convolutional networks are used to learn more details and features related to the geometric transformation which the standard convolutional neural networks cannot handle. Then a view-pooling layer is specifically designed for combining the descriptors from multiple views as the final representations of the 3D shapes. The key idea is to insert the deformable convolutional layer between the input and convolutional layer, making it possible to solve deformable 3D shape classification problems, which was a challenging task for MVCNN framework. The proposed method achieves state-of-the-art classification results on two subsets of the ModelNet dataset (ModelNet10 and ModelNet40) over previous methods by a significant margin.
引用
收藏
页码:1373 / 1374
页数:2
相关论文
共 50 条
  • [1] Multi-view Convolutional Neural Networks for 3D Shape Recognition
    Su, Hang
    Maji, Subhransu
    Kalogerakis, Evangelos
    Learned-Miller, Erik
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 945 - 953
  • [2] Multi-view SoftPool attention convolutional networks for 3D model classification
    Wang, Wenju
    Wang, Xiaolin
    Chen, Gang
    Zhou, Haoran
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [3] 3D multi-view convolutional neural networks for lung nodule classification
    Kang, Guixia
    Liu, Kui
    Hou, Beibei
    Zhang, Ningbo
    PLOS ONE, 2017, 12 (11):
  • [4] 3D Shape Reconstruction from Sketches via Multi-view Convolutional Networks
    Lun, Zhaoliang
    Gadelha, Matheus
    Kalogerakis, Evangelos
    Maji, Subhransu
    Wang, Rui
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 67 - 77
  • [5] Joint Multi-view 2D Convolutional Neural Networks for 3D Object Classification
    Xu, Jinglin
    Zhang, Xiangsen
    Li, Wenbin
    Liu, Xinwang
    Han, Junwei
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3202 - 3208
  • [6] Multi-view Fusion with Deep Learning for 3D Shape Classification
    Huang, Xiang
    Wang, Mantao
    Zhang, Dejun
    Zhu, Yu
    Zou, Lu
    Sun, Jun
    Han, Fei
    He, Linchao
    2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 189 - 194
  • [7] Multi-View and 3D Deformable Part Models
    Pepik, Bojan
    Stark, Michael
    Gehler, Peter
    Schiele, Bernt
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (11) : 2232 - 2245
  • [8] Multi-view classification with convolutional neural networks
    Seeland, Marco
    Maeder, Patrick
    PLOS ONE, 2021, 16 (01):
  • [9] Multi-View Classification and 3D Bounding Box Regression Networks
    Pramerdorfer, Christopher
    Kampel, Martin
    Van Loock, Mark
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 734 - 739
  • [10] MULTI-VIEW GAIT RECOGNITION USING 3D CONVOLUTIONAL NEURAL NETWORKS
    Wolf, Thomas
    Babaee, Mohammadreza
    Rigoll, Gerhard
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4165 - 4169