Granular3D: Delving into multi-granularity 3D scene graph prediction

被引：0

作者：

Huang, Kaixiang ^{[1
,2
]}

Yang, Jingru ^{[1
,2
]}

Wang, Jin ^{[1
,2
,6
]}

He, Shengfeng ^{[3
]}

Wang, Zhan ^{[4
]}

He, Haiyan ^{[1
,2
,5
]}

Zhang, Qifeng

Lu, Guodong ^{[1
,2
]}

机构：

[1] Zhejiang Univ, State Key Lab Fluid Power & Mechatron Syst, Hangzhou 310027, Zhejiang, Peoples R China

[2] Zhejiang Univ, Robot Inst, Hangzhou 310027, Zhejiang, Peoples R China

[3] Singapore Management Univ, Singapore 178903, Singapore

[4] Zhejiang Energy Digital Technol Co Ltd, Dept Artificial Intelligence & Robot, Hangzhou 310027, Zhejiang, Peoples R China

[5] Zhejiang Baima Lake Lab Co Ltd, Hangzhou 310000, Zhejiang, Peoples R China

[6] Jinhua Key Lab Robot Intelligent Welding Technol, Jinhua 321000, Zhejiang, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 153卷

基金：

中国国家自然科学基金;

关键词：

3D point cloud; 3D semantic scene graph prediction; Multi-granularity; Gather point transformer; LANGUAGE;

D O I：

10.1016/j.patcog.2024.110562

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the significant challenges in 3D Semantic Scene Graph (3DSSG) prediction, essential for understanding complex 3D environments. Traditional approaches, primarily using PointNet and Graph Convolutional Networks, struggle with effectively extracting multi -grained features from intricate 3D scenes, largely due to a focus on global scene processing and single -scale feature extraction. To overcome these limitations, we introduce Granular3D, a novel approach that shifts the focus towards multi -granularity analysis by predicting relation triplets from specific sub -scenes. One key is the Adaptive Instance Enveloping Method (AIEM), which establishes an approximate envelope structure around irregular instances, providing shape -adaptive local point cloud sampling, thereby comprehensively covering the contextual environments of instances. Moreover, Granular3D incorporates a Hierarchical Dual -Stage Network (HDSN), which differentiates and processes features of instances and their pairs at varying scales, leading to a targeted prediction of instance categories and their relationships. To advance the perception of sub -scene in HDSN, we design a Gather Point Transformer structure (GaPT) that enables the combinatorial interaction of local information from multiple point cloud sets, achieving a more comprehensive local contextual feature extraction. Extensive evaluations on the challenging 3DSSG benchmark demonstrate that our methods provide substantial improvements, establishing a new state-of-the-art in 3DSSG prediction, boosting the top -50 triplet accuracy by + 2.8%.

引用

页数：12

共 50 条

[1] Multi-Granularity Interaction for Multi-Person 3D Motion Prediction
Liu, Chenchen
Mu, Yadong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1546 - 1558
[2] Multi-Granularity Thermal Evaluation of 3D MPSoC Architectures
Fourmigue, Alain
Beltrame, Giovanni
Nicolescu, Gabriela
Aboulhamid, El Mostapha
O'Connor, Ian
2011 DESIGN, AUTOMATION & TEST IN EUROPE (DATE), 2011, : 575 - 578
[3] Multi-granularity Prediction for Scene Text Recognition
Wang, Peng
Da, Cheng
Yao, Cong
COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 339 - 355
[4] 3D scene graph prediction from point clouds
Wu F.
Yan F.
Shi W.
Zhou Z.
Virtual Reality and Intelligent Hardware, 2022, 4 (01): : 76 - 88
[5] 3D scene graph prediction from point clouds
Fanfan WU
Feihu YAN
Weimin SHI
Zhong ZHOU
虚拟现实与智能硬件(中英文), 2022, 4 (01) : 76 - 88
[6] Heterogeneous Graph Learning for Scene Graph Prediction in 3D Point Clouds
Ma, Yanni
Liu, Hao
Pei, Yun
Guo, Yulan
COMPUTER VISION - ECCV 2024, PT XXVI, 2025, 15084 : 274 - 291
[7] Prediction and Generation of 3D Functional Scene Based on Relation Graph
Sun Q.
Hu R.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (09): : 1351 - 1361
[8] Multi-granularity relationship reasoning network for high-fidelity 3D shape reconstruction
Li, Lei
Zhou, Zhiyuan
Wu, Suping
Li, Pan
Zhang, Boyang
PATTERN RECOGNITION, 2024, 155
[9] MULTI-GRANULARITY FEATURE INTERACTION AND RELATION REASONING FOR 3D DENSE ALIGNMENT AND FACE RECONSTRUCTION
Li, Lei
Li, Xiangzheng
Wu, Kangbo
Lin, Kui
Wu, Suping
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4265 - 4269
[10] MLGPnet: Multi-granularity neural network for 3D shape recognition using pyramid data
Li, Zekun
Seah, Hock Soon
Guo, Baolong
Yang, Muli
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 239

← 1 2 3 4 5 →