Siamese CNN-BiLSTM Architecture for 3D Shape Representation Learning

被引:0
|
作者
Dai, Guoxian [1 ,2 ,4 ]
Xie, Jin [1 ,2 ]
Fang, Yi [1 ,2 ,3 ]
机构
[1] NYU Abu Dhabi, NYU Multimedia & Visual Comp Lab, Abu Dhabi, U Arab Emirates
[2] NYU Abu Dhabi, Dept ECE, Abu Dhabi, U Arab Emirates
[3] NYU, Tandon Sch Engn, Dept ECE, New York, NY 10003 USA
[4] NYU, Tandon Sch Engn, Dept CSE, New York, NY 10003 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning a 3D shape representation from a collection of its rendered 2D images has been extensively studied. However, existing view-based techniques have not yet fully exploited the information among all the views of projections. In this paper, by employing recurrent neural network to efficiently capture features across different views, we propose a siamese CNN-BiLSTM network for 3D shape representation learning. The proposed method minimizes a discriminative loss function to learn a deep nonlinear transformation, mapping 3D shapes from the original space into a nonlinear feature space. In the transformed space, the distance of 3D shapes with the same label is minimized, otherwise the distance is maximized to a large margin. Specifically, the 3D shapes are first projected into a group of 2D images from different views. Then convolutional neural network (CNN) is adopted to extract features from different view images, followed by a bidirectional long short-term memory (LSTM) to aggregate information across different views. Finally, we construct the whole CNN-BiLSTM network into a siamese structure with contrastive loss function. Our proposed method is evaluated on two benchmarks, ModelNet40 and SHREC 2014, demonstrating superiority over the state-of-the-art methods.
引用
收藏
页码:670 / 676
页数:7
相关论文
共 50 条
  • [31] Texture synthesis for 3D shape representation
    Gorla, G
    Interrante, V
    Sapiro, G
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2003, 9 (04) : 512 - 524
  • [32] Medial Axis for 3D Shape Representation
    Qiu, Wei
    Sakai, Ko
    NEURAL INFORMATION PROCESSING, PT I, 2011, 7062 : 79 - +
  • [33] Attention-based aspect sentiment classification using enhanced learning through CNN-BiLSTM networks
    Ayetiran, Eniafe Festus
    KNOWLEDGE-BASED SYSTEMS, 2022, 252
  • [34] A dimensional reduction guiding deep learning architecture for 3D shape retrieval
    Wang, Zihao
    Lin, Hongwei
    Yu, Xiaofeng
    Hamza, Yusuf Fatihu
    COMPUTERS & GRAPHICS-UK, 2019, 81 : 82 - 91
  • [35] MOTION REPRESENTATION USING RESIDUAL FRAMES WITH 3D CNN
    Tao, Li
    Wang, Xueting
    Yamasaki, Toshihiko
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1786 - 1790
  • [36] Earthquake Magnitude Prediction using Spatia-Temporal Features Learning Based on Hybrid CNN-BiLSTM Model
    Kavianpour, Parisa
    Kavianpour, Mohammadreza
    Jahani, Ehsan
    Ramezani, Amin
    Proceedings - 2021 7th International Conference on Signal Processing and Intelligent Systems, ICSPIS 2021, 2021,
  • [37] 3D CNN based Partial 3D Shape Retrieval Focusing on Local Features
    Iwabuchi, Wataru
    Aono, Masaki
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1523 - 1529
  • [38] CNN-BiLSTM: A Novel Deep Learning Model for Near-Real-Time Daily Wildfire Spread Prediction
    Marjani, Mohammad
    Mahdianpari, Masoud
    Mohammadimanesh, Fariba
    REMOTE SENSING, 2024, 16 (08)
  • [39] Advanced AIoT for failure classification of industrial diesel generators based hybrid deep learning CNN-BiLSTM algorithm
    Thanh, Phuong Nguyen
    Cho, Ming-Yuan
    ADVANCED ENGINEERING INFORMATICS, 2024, 62
  • [40] SyncSpecCNN: Synchronized Spectral CNN for 3D Shape Segmentation
    Yi, Li
    Su, Hao
    Guo, Xingwen
    Guibas, Leonidas
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6584 - 6592