Siamese CNN-BiLSTM Architecture for 3D Shape Representation Learning

被引:0
|
作者
Dai, Guoxian [1 ,2 ,4 ]
Xie, Jin [1 ,2 ]
Fang, Yi [1 ,2 ,3 ]
机构
[1] NYU Abu Dhabi, NYU Multimedia & Visual Comp Lab, Abu Dhabi, U Arab Emirates
[2] NYU Abu Dhabi, Dept ECE, Abu Dhabi, U Arab Emirates
[3] NYU, Tandon Sch Engn, Dept ECE, New York, NY 10003 USA
[4] NYU, Tandon Sch Engn, Dept CSE, New York, NY 10003 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning a 3D shape representation from a collection of its rendered 2D images has been extensively studied. However, existing view-based techniques have not yet fully exploited the information among all the views of projections. In this paper, by employing recurrent neural network to efficiently capture features across different views, we propose a siamese CNN-BiLSTM network for 3D shape representation learning. The proposed method minimizes a discriminative loss function to learn a deep nonlinear transformation, mapping 3D shapes from the original space into a nonlinear feature space. In the transformed space, the distance of 3D shapes with the same label is minimized, otherwise the distance is maximized to a large margin. Specifically, the 3D shapes are first projected into a group of 2D images from different views. Then convolutional neural network (CNN) is adopted to extract features from different view images, followed by a bidirectional long short-term memory (LSTM) to aggregate information across different views. Finally, we construct the whole CNN-BiLSTM network into a siamese structure with contrastive loss function. Our proposed method is evaluated on two benchmarks, ModelNet40 and SHREC 2014, demonstrating superiority over the state-of-the-art methods.
引用
收藏
页码:670 / 676
页数:7
相关论文
共 50 条
  • [41] Multi-Scale Representation Learning on Hypergraph for 3D Shape Retrieval and Recognition
    Bai, Junjie
    Gong, Biao
    Zhao, Yining
    Lei, Fuqiang
    Yan, Chenggang
    Gao, Yue
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 5327 - 5338
  • [42] Learning Robust Point Representation for 3D Non-Rigid Shape Retrieval
    Wu, Hao
    Fang, Lincong
    Yu, Qian
    Yang, Chengzhuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4430 - 4444
  • [43] Ionospheric TEC Prediction in China during Storm Periods Based on Deep Learning: Mixed CNN-BiLSTM Method
    Ren, Xiaochen
    Zhao, Biqiang
    Ren, Zhipeng
    Xiong, Bo
    REMOTE SENSING, 2024, 16 (17)
  • [44] Hybrid Deep Learning Models for Tennis Action Recognition: Enhancing Professional Training Through CNN-BiLSTM Integration
    Chen, Zhaokun
    Xie, Qin
    Jiang, Wei
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2025, 37 (6-8):
  • [45] Activity Representation Using 3D Shape Models
    Abdelkader, Mohamed F.
    Roy-Chowdhury, Amit K.
    Chellappa, Rama
    Akdemir, Umut
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2008, 2008 (1)
  • [46] Sparse Representation for Robust 3D Shape Matching
    Tu, Hong
    Geng, Guohua
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE, 2014, 101 : 1005 - 1009
  • [47] Sampled medial loci for 3D shape representation
    Stolpner, Svetlana
    Whitesides, Sue
    Siddiqi, Kaleem
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2011, 115 (05) : 695 - 706
  • [48] H-CNN: Spatial Hashing Based CNN for 3D Shape Analysis
    Shao, Tianjia
    Yang, Yin
    Weng, Yanlin
    Hou, Qiming
    Zhou, Kun
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (07) : 2403 - 2416
  • [49] 3D shape representation: Transforming polygons into voxels
    Oomes, S
    Snoeren, P
    Dijkstra, T
    SCALE-SPACE THEORY IN COMPUTER VISION, 1997, 1252 : 349 - 352
  • [50] ADAPTIVE GRAPH FORMULATION FOR 3D SHAPE REPRESENTATION
    Alwaely, Basheer
    Abhayaratne, Charith
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1947 - 1951