Walk in Views: Multi-view Path Aggregation Graph Network for 3D Shape Analysis

被引:2
|
作者
Xu, Lixiang [1 ,2 ]
Cui, Qingzhe [1 ]
Xu, Wei [1 ]
Chen, Enhong [2 ]
Tong, He [3 ]
Tang, Yuanyan [4 ]
机构
[1] Hefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
[3] Chinese Peoples Liberat Army Aviat Inst, Dept Basic, Beijing 101123, Peoples R China
[4] FST Univ Macau, Zhuhai UM Sci & Technol Res Inst, Macau 999078, Macao, Peoples R China
基金
中国国家自然科学基金;
关键词
3D shape analysis; Path aggregation; Graph networks; Vision transformer; Multi-view fusion;
D O I
10.1016/j.inffus.2023.102131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The graph-based multi-view methods have achieved state-of-the-art results in 3D shape analysis tasks by taking advantage of graph convolutional networks (GCN) to process discrete data. However, the homogeneity of the traditional GCN aggregation operator leads to a problem in aggregating neighborhood information, i.e., if several views have the same neighbors, the same node embeddings will be generated, resulting in feature redundancy. To address this problem, we propose a Multi-view Path Aggregation Graph Network (MVPNet) for 3D shape analysis, which aims to extract a particular path from a graph composed of multiple views and aggregate it into an effective 3D shape descriptor. Specifically, we first extract a path in the graph through dynamic walking, and update the path status while searching for new nodes during the walking. Then we embed the position information of the nodes in the order of the nodes in the path. Finally, we propose to aggregate the features of a path employing a Path Transformer that is capable of handling ordered sequences. A path contains richer semantic and structural information than a traditional subgraph. To demonstrate the effectiveness of our proposed method, we conduct extensive experiments on three benchmark datasets, namely ModelNet, ShapeNetCore55 and MCB, and these experiments prove that the method outperforms the current methods in 3D shape classification and retrieval tasks.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Multi-view depth estimation based on multi-feature aggregation for 3D reconstruction
    Zhang, Chi
    Liang, Lingyu
    Zhou, Jijun
    Xu, Yong
    COMPUTERS & GRAPHICS-UK, 2024, 122
  • [42] VSFormer: Mining Correlations in Flexible View Set for Multi-View 3D Shape Understanding
    Sun, Hongyu
    Wang, Yongcai
    Wang, Peng
    Deng, Haoran
    Cai, Xudong
    Li, Deying
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, 31 (04) : 2127 - 2141
  • [43] A Multi-View Nonlinear Active Shape Model Based on 3D transformation Shape Search
    Yi Faling
    Xiong Wei
    Huang Zhanpeng
    Zhao Jie
    FIFTH INTERNATIONAL CONFERENCE ON INFORMATION ASSURANCE AND SECURITY, VOL 2, PROCEEDINGS, 2009, : 15 - 18
  • [44] ReINView: Re-interpreting Views for Multi-view 3D Object Recognition
    Xu, Ruchang
    Ma, Wei
    Mi, Qing
    Zha, Hongbin
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 6630 - 6636
  • [45] 3D Reconstruction for Multi-view Objects
    Yu, Jun
    Yin, Wenbin
    Hu, Zhiyi
    Liu, Yabin
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 106
  • [46] Multi-view 3D Reconstruction with Transformers
    Wang, Dan
    Cui, Xinrui
    Chen, Xun
    Zou, Zhengxia
    Shi, Tianyang
    Salcudean, Septimiu
    Wang, Z. Jane
    Ward, Rabab
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5702 - 5711
  • [47] Multi-View Image Capture for Glasses Free Multi-View 3D Displays
    Gurbuz, Sabri
    Yano, Sumio
    Iwasawa, Shoichiro
    Ando, Hiroshi
    IDW'10: PROCEEDINGS OF THE 17TH INTERNATIONAL DISPLAY WORKSHOPS, VOLS 1-3, 2010, : 2091 - 2094
  • [48] MV-LFN: Multi-view based local information fusion network for 3D shape recognition
    Zhang, Jing
    Zhou, Dangdang
    Zhao, Yue
    Nie, Weizhi
    Su, Yuting
    VISUAL INFORMATICS, 2021, 5 (03) : 114 - 119
  • [49] Robust Attentional Aggregation of Deep Feature Sets for Multi-view 3D Reconstruction
    Yang, Bo
    Wang, Sen
    Markham, Andrew
    Trigoni, Niki
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (01) : 53 - 73
  • [50] Robust Attentional Aggregation of Deep Feature Sets for Multi-view 3D Reconstruction
    Bo Yang
    Sen Wang
    Andrew Markham
    Niki Trigoni
    International Journal of Computer Vision, 2020, 128 : 53 - 73