ShapeScaffolder: Structure-Aware 3D Shape Generation from Text

被引：3

作者：

Tian, Xi ^{[1
]}

Yang, Yong-Liang ^{[1
]}

Wu, Qi ^{[2
]}

机构：

[1] Univ Bath, Bath, Avon, England

[2] Univ Adelaide, Adelaide, SA, Australia

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV | 2023年

关键词：

D O I：

10.1109/ICCV51070.2023.00256

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present ShapeScaffolder, a structure-based neural network for generating colored 3D shapes based on text input. The approach, similar to providing scaffolds as internal structural supports and adding more details to them, aims to capture finer text-shape connections and improve the quality of generated shapes. Traditional text-to- shape methods often generate 3D shapes as a whole. However, humans tend to understand both shape and text as being structure- based. For example, a table is interpreted as being composed of legs, a seat, and a back; similarly, texts possess inherent linguistic structures that can be analyzed as dependency graphs, depicting the relationships between entities within the text. We believe structure-aware shape generation can bring finer text-shape connections and improve shape generation quality. However, the lack of explicit shape structure and the high freedom of text structure make cross-modality learning challenging. To address these challenges, we first build the structured shape implicit fields in an unsupervised manner. We then propose the part-level attention mechanism between shape parts and textual graph nodes to align the two modalities at the structural level. Finally, we employ a shape refiner to add further detail to the predicted structure, yielding the final results. Extensive experimentation demonstrates that our approaches outperform state-of-the-art methods in terms of both shape fidelity and shape-text matching. Our methods also allow for part-level manipulation and improved part-level completeness.

引用

页码：2715 / 2724

页数：10

共 50 条

[1] Structure-aware shape correspondence network for 3D shape synthesis
Lang, Xufeng
Sun, Zhengxing
COMPUTER AIDED GEOMETRIC DESIGN, 2020, 79
[2] Structure-Aware 3D VR Sketch to 3D Shape Retrieval
Luo, Ling
Gryaditskaya, Yulia
Xiang, Tao
Song, Yi-Zhe
2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 383 - 392
[3] Structure-Aware Procedural Text Generation From an Image Sequence
Nishimura, Taichi
Hashimoto, Atsushi
Ushiku, Yoshitaka
Kameko, Hirotaka
Yamakata, Yoko
Mori, Shinsuke
IEEE ACCESS, 2021, 9 : 2125 - 2141
[4] StruMonoNet: Structure-Aware Monocular 3D Prediction
Yang, Zhenpei
Li, Li Erran
Huang, Qixing
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7409 - 7418
[5] StruMonoNet: Structure-Aware Monocular 3D Prediction
Yang, Zhenpei
Li, Li Erran
Huang, Qixing
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2021, : 7409 - 7418
[6] Physically reliable 3D styled shape generation via structure-aware topology optimization in unified latent space
Ijaz, Haroon
Wang, Xuwei
Chen, Wei
Lin, Hai
Li, Ming
COMPUTER-AIDED DESIGN, 2025, 183
[7] Structure-aware fusion network for 3D scene understanding
Yan, Haibin
Lv, Yating
Liong, Venice Erin
CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (05) : 194 - 203
[8] Structure-aware fusion network for 3D scene understanding
Haibin YAN
Yating LV
Venice Erin LIONG
Chinese Journal of Aeronautics, 2022, 35 (05) : 194 - 203
[9] Structure-aware fusion network for 3D scene understanding
YAN, Haibin
LV, Yating
LIONG, Venice Erin
Chinese Journal of Aeronautics, 2022, 35 (05): : 194 - 203
[10] Structure-aware fusion network for 3D scene understanding
Haibin YAN
Yating LV
Venice Erin LIONG
Chinese Journal of Aeronautics , 2022, (05) : 194 - 203

← 1 2 3 4 5 →