Walk in Views: Multi-view Path Aggregation Graph Network for 3D Shape Analysis

被引：2

作者：

Xu, Lixiang ^{[1
,2
]}

Cui, Qingzhe ^{[1
]}

Xu, Wei ^{[1
]}

Chen, Enhong ^{[2
]}

Tong, He ^{[3
]}

Tang, Yuanyan ^{[4
]}

机构：

[1] Hefei Univ, Coll Artificial Intelligence & Big Data, Hefei 230027, Anhui, Peoples R China

[2] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China

[3] Chinese Peoples Liberat Army Aviat Inst, Dept Basic, Beijing 101123, Peoples R China

[4] FST Univ Macau, Zhuhai UM Sci & Technol Res Inst, Macau 999078, Macao, Peoples R China

来源：

INFORMATION FUSION | 2024年 / 103卷

基金：

中国国家自然科学基金;

关键词：

3D shape analysis; Path aggregation; Graph networks; Vision transformer; Multi-view fusion;

D O I：

10.1016/j.inffus.2023.102131

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The graph-based multi-view methods have achieved state-of-the-art results in 3D shape analysis tasks by taking advantage of graph convolutional networks (GCN) to process discrete data. However, the homogeneity of the traditional GCN aggregation operator leads to a problem in aggregating neighborhood information, i.e., if several views have the same neighbors, the same node embeddings will be generated, resulting in feature redundancy. To address this problem, we propose a Multi-view Path Aggregation Graph Network (MVPNet) for 3D shape analysis, which aims to extract a particular path from a graph composed of multiple views and aggregate it into an effective 3D shape descriptor. Specifically, we first extract a path in the graph through dynamic walking, and update the path status while searching for new nodes during the walking. Then we embed the position information of the nodes in the order of the nodes in the path. Finally, we propose to aggregate the features of a path employing a Path Transformer that is capable of handling ordered sequences. A path contains richer semantic and structural information than a traditional subgraph. To demonstrate the effectiveness of our proposed method, we conduct extensive experiments on three benchmark datasets, namely ModelNet, ShapeNetCore55 and MCB, and these experiments prove that the method outperforms the current methods in 3D shape classification and retrieval tasks.

引用

页数：13

共 50 条

[21] Multi-view Fusion with Deep Learning for 3D Shape Classification
Huang, Xiang
Wang, Mantao
Zhang, Dejun
Zhu, Yu
Zou, Lu
Sun, Jun
Han, Fei
He, Linchao
2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 189 - 194
[22] Multi-view Convolutional Neural Networks for 3D Shape Recognition
Su, Hang
Maji, Subhransu
Kalogerakis, Evangelos
Learned-Miller, Erik
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 945 - 953
[23] Deformable convolutional networks for multi-view 3D shape classification
Ma, Pengfei
Ma, Jie
Wang, Xujiao
Yang, Lichuang
Wang, Nannan
ELECTRONICS LETTERS, 2018, 54 (24) : 1373 - 1374
[24] View-GCN: View-based Graph Convolutional Network for 3D Shape Analysis
Wei, Xin
Yu, Ruixuan
Sun, Jian
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1847 - 1856
[25] Multi-View Face Alignment Using 3D Shape Model for View Estimation
Su, Yanchao
Ai, Haizhou
Lao, Shihong
ADVANCES IN BIOMETRICS, 2009, 5558 : 179 - +
[26] Hierarchical Graph Attention Based Multi-View Convolutional Neural Network for 3D Object Recognition
Zeng, Hui
Zhao, Tianmeng
Cheng, Ruting
Wang, Fuzhou
Liu, Jiwei
IEEE ACCESS, 2021, 9 (09): : 33323 - 33335
[27] Multi-view graph imputation network
Peng, Xin
Cheng, Jieren
Tang, Xiangyan
Zhang, Bin
Tu, Wenxuan
INFORMATION FUSION, 2024, 102
[28] MVCLN: Multi-View Convolutional LSTM Network for Cross-Media 3D Shape Recognition
Liang, Qi
Wang, Yixin
Nie, Weizhi
Li, Qiang
IEEE ACCESS, 2020, 8 : 139792 - 139802
[29] Multi-View Attentive Contextualization for Multi-View 3D Object Detection
Liu, Xianpeng
Zheng, Ce
Qian, Ming
Xue, Nan
Chen, Chen
Zhang, Zhebin
Li, Chen
Wu, Tianfu
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16688 - 16698
[30] Hierarchical Graph Structure Learning for Multi-View 3D Model Retrieval
Su, Yuting
Li, Wenhui
Liu, Anan
Nie, Weizhi
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 913 - 919

← 1 2 3 4 5 →