3D model recognition and segmentation based on multi-feature fusion

被引:0
|
作者
Dang J. [1 ]
Yang J. [1 ]
机构
[1] School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou
来源
Yang, Jun (yangj@mail.lzjtu.cn) | 1600年 / Science Press卷 / 47期
关键词
3D point cloud; Attention fusion; Convolutional neural networks; Deep learning; Object recognition; Semantic segmentation;
D O I
10.19665/j.issn1001-2400.2020.04.020
中图分类号
学科分类号
摘要
Current methods focusing on 3D model recognition and segmentation have to some extent ignored the relationship between the high-level global single-point features and the low-level local geometric features of those models, resulting in poor recognition results. A multi-feature fusion approach which takes into consideration the aforementioned ignored relationship is proposed. First, a global single-point network is established to extract the global single-point features with high-level semantic recognition ability by increasing both the width of convolution kernel and the depth of the network. Second, an attentional fusion layer is constructed to learn the implicit relationship between global single-point features and local geometric features to fully explore the fine-grained geometric features that can better represent model categories. Finally, the global single-point features and fine-grained geometric features are further fused to achieve the complementation of advantages and enhance the feature richness. Experimental verification is carried out on the 3D model recognition datasets ModelNet40, ModelNet10 and segmentation datasets ShapeNet Parts, S3DIS, vKITTI, respectively, and comparison with current mainstream recognition algorithms shows that the proposed algorithm not only has higher recognition and segmentation accuracy, but also has stronger robustness. © 2020, The Editorial Board of Journal of Xidian University. All right reserved.
引用
收藏
页码:149 / 157
页数:8
相关论文
共 26 条
  • [1] KRIZHEVSKY A, SUTSKEVER I, HINTON G E., Imagenet Classification with Deep Convolutional Neural Networks, Proceedings of the 2012 25th International Conference on Neural Information Processing Systems: 1, pp. 1097-1105, (2012)
  • [2] ZHANG Jingang, FANG Yuan, YUAN Hao, Et al., Multiple Convolutional Neural Networks for Facial Expression Sequence Recognition, Journal of Xidian University, 45, 1, pp. 150-155, (2018)
  • [3] ZHU Suya, Du Jianchao, LI Yunsong, Et al., Method for Bridge Crack Detection Based on the U-Net Convolutional Network, Journal of Xidian University, 6, 4, pp. 35-42, (2019)
  • [4] LIU Daohua, CUI Yushuang, ZHAO Yansong, Et al., Method for Retrieving the Teaching Image Based on the Improved Convolutional Neural Networks, Journal of Xidian University, 3, pp. 52-58, (2019)
  • [5] WU Z, SONG S, KHOSLA A, Et al., 3D Shapenets: a Deep Representation for Volumetric Shapes, Proceedings of the 2015 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1912-1920, (2015)
  • [6] MATURANA D, SCHERER S., Voxnet: A 3D Convolutional Neural Network for Real-time Object Recognition, Proceedings of the 2015 IEEE International Conference on Intelligent Robots and Systems, pp. 922-928, (2015)
  • [7] YANG Jun, WANG Yimin, 3D Model Recognition and Classification Based on Deep Convolution Neural Network, Journal of Chongqing University of Posts and Telecommunications, 31, 2, pp. 253-260, (2019)
  • [8] YANG Jun, WANG Shun, ZHOU Peng, 3D Model Recognition and Classification Based on Deep Voxel Convolution Neural Network, Acta Optica Sinica, 39, 4, pp. 314-324, (2019)
  • [9] RIEGLER G, ULUSOY A O, GEIGER A., Octnet: Learning Deep 3D Representations at High Resolutions, Proceedings of the 2017 30th IEEE Conference on Computer Vision and Pattern Recognition, pp. 6620-6629, (2017)
  • [10] SU H, MAJI S, KALOGERAKIS E, Et al., Multi-view Convolutional Neural Networks for 3D Shape Recognition, Proceedings of the 2015 IEEE International Conference on Computer Vision, pp. 945-953, (2015)