Deformable convolutional networks for multi-view 3D shape classification

被引：12

作者：

Ma, Pengfei ^{[1
]}

Ma, Jie ^{[1
]}

Wang, Xujiao ^{[1
]}

Yang, Lichuang ^{[1
]}

Wang, Nannan ^{[1
]}

机构：

[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China

来源：

ELECTRONICS LETTERS | 2018年 / 54卷 / 24期

关键词：

learning (artificial intelligence); image classification; feature extraction; image representation; feedforward neural nets; computational geometry; deformable convolutional networks; multiview 3D shape classification; geometric transformation modelling capability; multiview convolutional networks; view-pooling layer; deformable convolutional layer; input; deformable 3D shape classification problems; MVCNN framework; ModelNet10; dataset; ModelNet40;

D O I：

10.1049/el.2018.6851

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This Letter suggests a novel method for improving the robustness and the geometric transformation modelling capability in multi-view convolutional networks (MVCNNs). First, the deformable convolutional networks are used to learn more details and features related to the geometric transformation which the standard convolutional neural networks cannot handle. Then a view-pooling layer is specifically designed for combining the descriptors from multiple views as the final representations of the 3D shapes. The key idea is to insert the deformable convolutional layer between the input and convolutional layer, making it possible to solve deformable 3D shape classification problems, which was a challenging task for MVCNN framework. The proposed method achieves state-of-the-art classification results on two subsets of the ModelNet dataset (ModelNet10 and ModelNet40) over previous methods by a significant margin.

引用

页码：1373 / 1374

页数：2

共 50 条

[41] Multi-view 3D Models from Single Images with a Convolutional Network
Tatarchenko, Maxim
Dosovitskiy, Alexey
Brox, Thomas
COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 322 - 337
[42] Adaptive region aggregation for multi-view stereo matching using deformable convolutional networks
Hu, Han
Su, Liupeng
Mao, Shunfu
Chen, Min
Pan, Guoqiang
Xu, Bo
Zhu, Qing
PHOTOGRAMMETRIC RECORD, 2023, 38 (183): : 430 - 449
[43] Multi-view 3D Reconstruction Based on Deformable Convolution and Laplace Pyramid Residuals
Hao, Zhaoming
Zhang, Ziyang
Li, Hongyan
Xu, Baoqing
Zhang, Xiaoqiong
Xu, Meng
Wang, Weifeng
IAENG International Journal of Computer Science, 2024, 51 (07) : 896 - 905
[44] Deformable 3D Shape Classification Using 3D Racah Moments and Deep Neural Networks
Lakhili, Zouhir
El Alami, Abdelmajid
Mesbah, Abderrahim
Berrahou, Aissam
Qjidaa, Hassan
SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2018), 2019, 148 : 12 - 20
[45] Multi-View Attentive Contextualization for Multi-View 3D Object Detection
Liu, Xianpeng
Zheng, Ce
Qian, Ming
Xue, Nan
Chen, Chen
Zhang, Zhebin
Li, Chen
Wu, Tianfu
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16688 - 16698
[46] Multi-View 3D Video Delivery for Broadband IP Networks
Ho, Ting-Yu
Yeh, Yi-Nung
Yang, De-Nian
2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2015, : 5796 - 5802
[47] Multi-view facial landmark detection by using a 3D shape model
Cech, Jan
Franc, Vojtech
Uricar, Michal
Matas, Jiri
IMAGE AND VISION COMPUTING, 2016, 47 : 60 - 70
[48] Multi-view Shape Generation for a 3D Human-like Body
Yu, Hang
Cheang, Chilam
Fu, Yanwei
Xue, Xiangyang
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
[49] Group Multi-View Transformer for 3D Shape Analysis With Spatial Encoding
Xu, Lixiang
Cui, Qingzhe
Hong, Richang
Xu, Wei
Chen, Enhong
Yuan, Xin
Li, Chenglong
Tang, Yuanyan
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9450 - 9463
[50] Minimum variance estimation of 3D face shape from multi-view
Zhang, Zhenqiu
Hu, Yuxiao
Yu, Tianli
Huang, Thomas
PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION - PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE, 2006, : 547 - +

← 1 2 3 4 5 →