ShapeScaffolder: Structure-Aware 3D Shape Generation from Text

被引:3
|
作者
Tian, Xi [1 ]
Yang, Yong-Liang [1 ]
Wu, Qi [2 ]
机构
[1] Univ Bath, Bath, Avon, England
[2] Univ Adelaide, Adelaide, SA, Australia
关键词
D O I
10.1109/ICCV51070.2023.00256
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present ShapeScaffolder, a structure-based neural network for generating colored 3D shapes based on text input. The approach, similar to providing scaffolds as internal structural supports and adding more details to them, aims to capture finer text-shape connections and improve the quality of generated shapes. Traditional text-to- shape methods often generate 3D shapes as a whole. However, humans tend to understand both shape and text as being structure- based. For example, a table is interpreted as being composed of legs, a seat, and a back; similarly, texts possess inherent linguistic structures that can be analyzed as dependency graphs, depicting the relationships between entities within the text. We believe structure-aware shape generation can bring finer text-shape connections and improve shape generation quality. However, the lack of explicit shape structure and the high freedom of text structure make cross-modality learning challenging. To address these challenges, we first build the structured shape implicit fields in an unsupervised manner. We then propose the part-level attention mechanism between shape parts and textual graph nodes to align the two modalities at the structural level. Finally, we employ a shape refiner to add further detail to the predicted structure, yielding the final results. Extensive experimentation demonstrates that our approaches outperform state-of-the-art methods in terms of both shape fidelity and shape-text matching. Our methods also allow for part-level manipulation and improved part-level completeness.
引用
收藏
页码:2715 / 2724
页数:10
相关论文
共 50 条
  • [1] Structure-aware shape correspondence network for 3D shape synthesis
    Lang, Xufeng
    Sun, Zhengxing
    COMPUTER AIDED GEOMETRIC DESIGN, 2020, 79
  • [2] Structure-Aware 3D VR Sketch to 3D Shape Retrieval
    Luo, Ling
    Gryaditskaya, Yulia
    Xiang, Tao
    Song, Yi-Zhe
    2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 383 - 392
  • [3] Structure-Aware Procedural Text Generation From an Image Sequence
    Nishimura, Taichi
    Hashimoto, Atsushi
    Ushiku, Yoshitaka
    Kameko, Hirotaka
    Yamakata, Yoko
    Mori, Shinsuke
    IEEE ACCESS, 2021, 9 : 2125 - 2141
  • [4] StruMonoNet: Structure-Aware Monocular 3D Prediction
    Yang, Zhenpei
    Li, Li Erran
    Huang, Qixing
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7409 - 7418
  • [5] StruMonoNet: Structure-Aware Monocular 3D Prediction
    Yang, Zhenpei
    Li, Li Erran
    Huang, Qixing
    Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2021, : 7409 - 7418
  • [6] Physically reliable 3D styled shape generation via structure-aware topology optimization in unified latent space
    Ijaz, Haroon
    Wang, Xuwei
    Chen, Wei
    Lin, Hai
    Li, Ming
    COMPUTER-AIDED DESIGN, 2025, 183
  • [7] Structure-aware fusion network for 3D scene understanding
    Yan, Haibin
    Lv, Yating
    Liong, Venice Erin
    CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (05) : 194 - 203
  • [8] Structure-aware fusion network for 3D scene understanding
    Haibin YAN
    Yating LV
    Venice Erin LIONG
    Chinese Journal of Aeronautics, 2022, 35 (05) : 194 - 203
  • [9] Structure-aware fusion network for 3D scene understanding
    YAN, Haibin
    LV, Yating
    LIONG, Venice Erin
    Chinese Journal of Aeronautics, 2022, 35 (05): : 194 - 203
  • [10] Structure-aware fusion network for 3D scene understanding
    Haibin YAN
    Yating LV
    Venice Erin LIONG
    Chinese Journal of Aeronautics , 2022, (05) : 194 - 203