Walk in Views: Multi-view Path Aggregation Graph Network for 3D Shape Analysis

被引:2
|
作者
Xu, Lixiang [1 ,2 ]
Cui, Qingzhe [1 ]
Xu, Wei [1 ]
Chen, Enhong [2 ]
Tong, He [3 ]
Tang, Yuanyan [4 ]
机构
[1] Hefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
[3] Chinese Peoples Liberat Army Aviat Inst, Dept Basic, Beijing 101123, Peoples R China
[4] FST Univ Macau, Zhuhai UM Sci & Technol Res Inst, Macau 999078, Macao, Peoples R China
基金
中国国家自然科学基金;
关键词
3D shape analysis; Path aggregation; Graph networks; Vision transformer; Multi-view fusion;
D O I
10.1016/j.inffus.2023.102131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The graph-based multi-view methods have achieved state-of-the-art results in 3D shape analysis tasks by taking advantage of graph convolutional networks (GCN) to process discrete data. However, the homogeneity of the traditional GCN aggregation operator leads to a problem in aggregating neighborhood information, i.e., if several views have the same neighbors, the same node embeddings will be generated, resulting in feature redundancy. To address this problem, we propose a Multi-view Path Aggregation Graph Network (MVPNet) for 3D shape analysis, which aims to extract a particular path from a graph composed of multiple views and aggregate it into an effective 3D shape descriptor. Specifically, we first extract a path in the graph through dynamic walking, and update the path status while searching for new nodes during the walking. Then we embed the position information of the nodes in the order of the nodes in the path. Finally, we propose to aggregate the features of a path employing a Path Transformer that is capable of handling ordered sequences. A path contains richer semantic and structural information than a traditional subgraph. To demonstrate the effectiveness of our proposed method, we conduct extensive experiments on three benchmark datasets, namely ModelNet, ShapeNetCore55 and MCB, and these experiments prove that the method outperforms the current methods in 3D shape classification and retrieval tasks.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Multi-view Fusion with Deep Learning for 3D Shape Classification
    Huang, Xiang
    Wang, Mantao
    Zhang, Dejun
    Zhu, Yu
    Zou, Lu
    Sun, Jun
    Han, Fei
    He, Linchao
    2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 189 - 194
  • [22] Multi-view Convolutional Neural Networks for 3D Shape Recognition
    Su, Hang
    Maji, Subhransu
    Kalogerakis, Evangelos
    Learned-Miller, Erik
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 945 - 953
  • [23] Deformable convolutional networks for multi-view 3D shape classification
    Ma, Pengfei
    Ma, Jie
    Wang, Xujiao
    Yang, Lichuang
    Wang, Nannan
    ELECTRONICS LETTERS, 2018, 54 (24) : 1373 - 1374
  • [24] View-GCN: View-based Graph Convolutional Network for 3D Shape Analysis
    Wei, Xin
    Yu, Ruixuan
    Sun, Jian
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1847 - 1856
  • [25] Multi-View Face Alignment Using 3D Shape Model for View Estimation
    Su, Yanchao
    Ai, Haizhou
    Lao, Shihong
    ADVANCES IN BIOMETRICS, 2009, 5558 : 179 - +
  • [26] Hierarchical Graph Attention Based Multi-View Convolutional Neural Network for 3D Object Recognition
    Zeng, Hui
    Zhao, Tianmeng
    Cheng, Ruting
    Wang, Fuzhou
    Liu, Jiwei
    IEEE ACCESS, 2021, 9 (09): : 33323 - 33335
  • [27] Multi-view graph imputation network
    Peng, Xin
    Cheng, Jieren
    Tang, Xiangyan
    Zhang, Bin
    Tu, Wenxuan
    INFORMATION FUSION, 2024, 102
  • [28] MVCLN: Multi-View Convolutional LSTM Network for Cross-Media 3D Shape Recognition
    Liang, Qi
    Wang, Yixin
    Nie, Weizhi
    Li, Qiang
    IEEE ACCESS, 2020, 8 : 139792 - 139802
  • [29] Multi-View Attentive Contextualization for Multi-View 3D Object Detection
    Liu, Xianpeng
    Zheng, Ce
    Qian, Ming
    Xue, Nan
    Chen, Chen
    Zhang, Zhebin
    Li, Chen
    Wu, Tianfu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16688 - 16698
  • [30] Hierarchical Graph Structure Learning for Multi-View 3D Model Retrieval
    Su, Yuting
    Li, Wenhui
    Liu, Anan
    Nie, Weizhi
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 913 - 919