ZS3D-Net: Zero-Shot Classification Network for 3D Models

被引:0
|
作者
Bai J. [1 ,2 ]
Yuan T. [1 ]
Fan Y. [1 ]
机构
[1] School of Computer Science and Engineering, North Minzu University, Yinchuan
[2] Key Laboratory of Images, Graphics Intelligent Processing of State Ethnic Affairs Commission, North Minzu University, Yinchuan
关键词
3D model classification; deep learning; semantic manifold embedding; zero-shot learning;
D O I
10.3724/SP.J.1089.2022.19173
中图分类号
学科分类号
摘要
Zero-shot 3D model classification is very important for the understanding and analysis of 3D models. Aiming at the problems of lack of corresponding datasets and low accuracy of zero-shot 3D model classification, a 3D model dataset ZS3D is constructed and a deep learning network ZS3D-Net is proposed. The dataset consists of 41 classes, 1677 non-rigid 3D models with complete attributes of all classes, which can be regarded as the benchmark for zero-shot 3D model classification task. For the network, firstly, the visual features of the 3D models are effectively extracted through an ensemble learning sub-network. Then, the correlation between the visual features and semantic features of the unseen and seen classes can be constructed by a semantic manifold embedding sub-network. Finally, the unseen classes can be recognized based on above two sub-networks. On a traditional 3D model dataset and the proposed ZS3D, ZS3D-Net achieves 30.0% and 58.6% classification accuracy respectively, which are on par or better than the state-of-the-art methods. The experiments also demonstrate that the proposed method has good feasibility and validity. © 2022 Institute of Computing Technology. All rights reserved.
引用
收藏
页码:1118 / 1126
页数:8
相关论文
共 28 条
  • [11] Wu Z R, Song S R, Khosla A, Et al., 3D ShapeNets: a deep representation for volumetric shapes, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1912-1920, (2015)
  • [12] Maturana D, Scherer S., VoxNet: a 3D convolutional neural network for real-time object recognition, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 922-928, (2015)
  • [13] Brock A, Lim T, Ritchie J M, Et al., Generative and discriminative voxel modeling with convolutional neural networks
  • [14] Qi C R, Hao S, Mo K C, Et al., PointNet: deep learning on point sets for 3D classification and segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 77-85, (2017)
  • [15] Qi C R, Li Y, Hao S, Et al., PointNet++: deep hierarchical feature learning on pointsets in a metric space, Proceedings of the 31st International Conference on Neural Information Processing Systems, pp. 5105-5114, (2017)
  • [16] Li Y Y, Bu R, Sun M C, Et al., PointCNN: convolution on X-transformed points, Proceedings of the 32nd Inter-national Conference on Neural Information Processing Systems, pp. 828-838, (2018)
  • [17] Bai Jing, Si Qinglong, Qin Feiwei, Lightweight real-time point cloud classification network LightPointNet, Journal of Computer-Aided Design & Computer Graphics, 31, 4, pp. 612-621, (2019)
  • [18] Bai Jing, Xu Haojun, MSP-Net: multi-scale point cloud classification network, Journal of Computer-Aided Design & Computer Graphics, 31, 11, pp. 1917-1924, (2019)
  • [19] Su H, Maji S, Kalogerakis E, Et al., Multi-view convolutional neural networks for 3D shape recognition, Proceedings of the IEEE International Conference on Computer Vision, pp. 945-953, (2015)
  • [20] Feng Y F, Zhang Z Z, Zhao X B, Et al., GVCNN: group-view convolutional neural networks for 3D shape recognition, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 264-272, (2018)