Walk in Views: Multi-view Path Aggregation Graph Network for 3D Shape Analysis

被引：2

作者：

Xu, Lixiang ^{[1
,2
]}

Cui, Qingzhe ^{[1
]}

Xu, Wei ^{[1
]}

Chen, Enhong ^{[2
]}

Tong, He ^{[3
]}

Tang, Yuanyan ^{[4
]}

机构：

[1] Hefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Anhui, Peoples R China

[2] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China

[3] Chinese Peoples Liberat Army Aviat Inst, Dept Basic, Beijing 101123, Peoples R China

[4] FST Univ Macau, Zhuhai UM Sci & Technol Res Inst, Macau 999078, Macao, Peoples R China

来源：

INFORMATION FUSION | 2024年 / 103卷

基金：

中国国家自然科学基金;

关键词：

3D shape analysis; Path aggregation; Graph networks; Vision transformer; Multi-view fusion;

D O I：

10.1016/j.inffus.2023.102131

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The graph-based multi-view methods have achieved state-of-the-art results in 3D shape analysis tasks by taking advantage of graph convolutional networks (GCN) to process discrete data. However, the homogeneity of the traditional GCN aggregation operator leads to a problem in aggregating neighborhood information, i.e., if several views have the same neighbors, the same node embeddings will be generated, resulting in feature redundancy. To address this problem, we propose a Multi-view Path Aggregation Graph Network (MVPNet) for 3D shape analysis, which aims to extract a particular path from a graph composed of multiple views and aggregate it into an effective 3D shape descriptor. Specifically, we first extract a path in the graph through dynamic walking, and update the path status while searching for new nodes during the walking. Then we embed the position information of the nodes in the order of the nodes in the path. Finally, we propose to aggregate the features of a path employing a Path Transformer that is capable of handling ordered sequences. A path contains richer semantic and structural information than a traditional subgraph. To demonstrate the effectiveness of our proposed method, we conduct extensive experiments on three benchmark datasets, namely ModelNet, ShapeNetCore55 and MCB, and these experiments prove that the method outperforms the current methods in 3D shape classification and retrieval tasks.

引用

页数：13

共 50 条

[41] Multi-view depth estimation based on multi-feature aggregation for 3D reconstruction
Zhang, Chi
Liang, Lingyu
Zhou, Jijun
Xu, Yong
COMPUTERS & GRAPHICS-UK, 2024, 122
[42] VSFormer: Mining Correlations in Flexible View Set for Multi-View 3D Shape Understanding
Sun, Hongyu
Wang, Yongcai
Wang, Peng
Deng, Haoran
Cai, Xudong
Li, Deying
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, 31 (04) : 2127 - 2141
[43] A Multi-View Nonlinear Active Shape Model Based on 3D transformation Shape Search
Yi Faling
Xiong Wei
Huang Zhanpeng
Zhao Jie
FIFTH INTERNATIONAL CONFERENCE ON INFORMATION ASSURANCE AND SECURITY, VOL 2, PROCEEDINGS, 2009, : 15 - 18
[44] ReINView: Re-interpreting Views for Multi-view 3D Object Recognition
Xu, Ruchang
Ma, Wei
Mi, Qing
Zha, Hongbin
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 6630 - 6636
[45] 3D Reconstruction for Multi-view Objects
Yu, Jun
Yin, Wenbin
Hu, Zhiyi
Liu, Yabin
COMPUTERS & ELECTRICAL ENGINEERING, 2023, 106
[46] Multi-view 3D Reconstruction with Transformers
Wang, Dan
Cui, Xinrui
Chen, Xun
Zou, Zhengxia
Shi, Tianyang
Salcudean, Septimiu
Wang, Z. Jane
Ward, Rabab
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5702 - 5711
[47] Multi-View Image Capture for Glasses Free Multi-View 3D Displays
Gurbuz, Sabri
Yano, Sumio
Iwasawa, Shoichiro
Ando, Hiroshi
IDW'10: PROCEEDINGS OF THE 17TH INTERNATIONAL DISPLAY WORKSHOPS, VOLS 1-3, 2010, : 2091 - 2094
[48] MV-LFN: Multi-view based local information fusion network for 3D shape recognition
Zhang, Jing
Zhou, Dangdang
Zhao, Yue
Nie, Weizhi
Su, Yuting
VISUAL INFORMATICS, 2021, 5 (03) : 114 - 119
[49] Robust Attentional Aggregation of Deep Feature Sets for Multi-view 3D Reconstruction
Yang, Bo
Wang, Sen
Markham, Andrew
Trigoni, Niki
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (01) : 53 - 73
[50] Robust Attentional Aggregation of Deep Feature Sets for Multi-view 3D Reconstruction
Bo Yang
Sen Wang
Andrew Markham
Niki Trigoni
International Journal of Computer Vision, 2020, 128 : 53 - 73

← 1 2 3 4 5 →