Deformable convolutional networks for multi-view 3D shape classification

被引:12
|
作者
Ma, Pengfei [1 ]
Ma, Jie [1 ]
Wang, Xujiao [1 ]
Yang, Lichuang [1 ]
Wang, Nannan [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China
关键词
learning (artificial intelligence); image classification; feature extraction; image representation; feedforward neural nets; computational geometry; deformable convolutional networks; multiview 3D shape classification; geometric transformation modelling capability; multiview convolutional networks; view-pooling layer; deformable convolutional layer; input; deformable 3D shape classification problems; MVCNN framework; ModelNet10; dataset; ModelNet40;
D O I
10.1049/el.2018.6851
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This Letter suggests a novel method for improving the robustness and the geometric transformation modelling capability in multi-view convolutional networks (MVCNNs). First, the deformable convolutional networks are used to learn more details and features related to the geometric transformation which the standard convolutional neural networks cannot handle. Then a view-pooling layer is specifically designed for combining the descriptors from multiple views as the final representations of the 3D shapes. The key idea is to insert the deformable convolutional layer between the input and convolutional layer, making it possible to solve deformable 3D shape classification problems, which was a challenging task for MVCNN framework. The proposed method achieves state-of-the-art classification results on two subsets of the ModelNet dataset (ModelNet10 and ModelNet40) over previous methods by a significant margin.
引用
收藏
页码:1373 / 1374
页数:2
相关论文
共 50 条
  • [41] Multi-view 3D Models from Single Images with a Convolutional Network
    Tatarchenko, Maxim
    Dosovitskiy, Alexey
    Brox, Thomas
    COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 322 - 337
  • [42] Adaptive region aggregation for multi-view stereo matching using deformable convolutional networks
    Hu, Han
    Su, Liupeng
    Mao, Shunfu
    Chen, Min
    Pan, Guoqiang
    Xu, Bo
    Zhu, Qing
    PHOTOGRAMMETRIC RECORD, 2023, 38 (183): : 430 - 449
  • [43] Multi-view 3D Reconstruction Based on Deformable Convolution and Laplace Pyramid Residuals
    Hao, Zhaoming
    Zhang, Ziyang
    Li, Hongyan
    Xu, Baoqing
    Zhang, Xiaoqiong
    Xu, Meng
    Wang, Weifeng
    IAENG International Journal of Computer Science, 2024, 51 (07) : 896 - 905
  • [44] Deformable 3D Shape Classification Using 3D Racah Moments and Deep Neural Networks
    Lakhili, Zouhir
    El Alami, Abdelmajid
    Mesbah, Abderrahim
    Berrahou, Aissam
    Qjidaa, Hassan
    SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2018), 2019, 148 : 12 - 20
  • [45] Multi-View Attentive Contextualization for Multi-View 3D Object Detection
    Liu, Xianpeng
    Zheng, Ce
    Qian, Ming
    Xue, Nan
    Chen, Chen
    Zhang, Zhebin
    Li, Chen
    Wu, Tianfu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16688 - 16698
  • [46] Multi-View 3D Video Delivery for Broadband IP Networks
    Ho, Ting-Yu
    Yeh, Yi-Nung
    Yang, De-Nian
    2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2015, : 5796 - 5802
  • [47] Multi-view facial landmark detection by using a 3D shape model
    Cech, Jan
    Franc, Vojtech
    Uricar, Michal
    Matas, Jiri
    IMAGE AND VISION COMPUTING, 2016, 47 : 60 - 70
  • [48] Multi-view Shape Generation for a 3D Human-like Body
    Yu, Hang
    Cheang, Chilam
    Fu, Yanwei
    Xue, Xiangyang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [49] Group Multi-View Transformer for 3D Shape Analysis With Spatial Encoding
    Xu, Lixiang
    Cui, Qingzhe
    Hong, Richang
    Xu, Wei
    Chen, Enhong
    Yuan, Xin
    Li, Chenglong
    Tang, Yuanyan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9450 - 9463
  • [50] Minimum variance estimation of 3D face shape from multi-view
    Zhang, Zhenqiu
    Hu, Yuxiao
    Yu, Tianli
    Huang, Thomas
    PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION - PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE, 2006, : 547 - +