Deformable convolutional networks for multi-view 3D shape classification

被引:12
|
作者
Ma, Pengfei [1 ]
Ma, Jie [1 ]
Wang, Xujiao [1 ]
Yang, Lichuang [1 ]
Wang, Nannan [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China
关键词
learning (artificial intelligence); image classification; feature extraction; image representation; feedforward neural nets; computational geometry; deformable convolutional networks; multiview 3D shape classification; geometric transformation modelling capability; multiview convolutional networks; view-pooling layer; deformable convolutional layer; input; deformable 3D shape classification problems; MVCNN framework; ModelNet10; dataset; ModelNet40;
D O I
10.1049/el.2018.6851
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This Letter suggests a novel method for improving the robustness and the geometric transformation modelling capability in multi-view convolutional networks (MVCNNs). First, the deformable convolutional networks are used to learn more details and features related to the geometric transformation which the standard convolutional neural networks cannot handle. Then a view-pooling layer is specifically designed for combining the descriptors from multiple views as the final representations of the 3D shapes. The key idea is to insert the deformable convolutional layer between the input and convolutional layer, making it possible to solve deformable 3D shape classification problems, which was a challenging task for MVCNN framework. The proposed method achieves state-of-the-art classification results on two subsets of the ModelNet dataset (ModelNet10 and ModelNet40) over previous methods by a significant margin.
引用
收藏
页码:1373 / 1374
页数:2
相关论文
共 50 条
  • [21] Dynamic View Aggregation for Multi-View 3D Shape Recognition
    Zhou, Yuan
    Sun, Zhongqi
    Huo, Shuwei
    Kung, Sun-Yuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9163 - 9174
  • [22] PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition
    You, Haoxuan
    Feng, Yifan
    Ji, Rongrong
    Gao, Yue
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1310 - 1318
  • [23] MV-C3D: A Spatial Correlated Multi-View 3D Convolutional Neural Networks
    Xuan, Qi
    Li, Fuxian
    Liu, Yi
    Xiang, Yun
    IEEE ACCESS, 2019, 7 : 92528 - 92538
  • [24] 3D Shape Completion with Multi-View Consistent Inference
    Hu, Tao
    Han, Zhizhong
    Zwicker, Matthias
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10997 - 11004
  • [25] Multi-view expressive graph neural networks for 3D CAD model classification
    Li, Shuang
    Corney, Jonathan
    COMPUTERS IN INDUSTRY, 2023, 151
  • [26] Contrastive Multi-View Learning for 3D Shape Clustering
    Peng, Bo
    Lin, Guoting
    Lei, Jianjun
    Qin, Tianyi
    Cao, Xiaochun
    Ling, Nam
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6262 - 6272
  • [27] Improved weak texture multi-view 3D reconstruction algorithm based on deformable convolutions networks
    Peng, Bo
    Li, Yi
    JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2024, : 5495 - 5506
  • [28] Multi-View Convolutional Neural Networks for Mammographic Image Classification
    Sun, Lilei
    Wang, Junqian
    Hu, Zhijun
    Xu, Yong
    Cui, Zhongwei
    IEEE ACCESS, 2019, 7 : 126273 - 126282
  • [29] MVCLN: Multi-View Convolutional LSTM Network for Cross-Media 3D Shape Recognition
    Liang, Qi
    Wang, Yixin
    Nie, Weizhi
    Li, Qiang
    IEEE ACCESS, 2020, 8 : 139792 - 139802
  • [30] Multi-view convolutional vision transformer for 3D object recognition
    Li, Jie
    Liu, Zhao
    Li, Li
    Lin, Junqin
    Yao, Jian
    Tu, Jingmin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95