ShapeScaffolder: Structure-Aware 3D Shape Generation from Text

被引:3
|
作者
Tian, Xi [1 ]
Yang, Yong-Liang [1 ]
Wu, Qi [2 ]
机构
[1] Univ Bath, Bath, Avon, England
[2] Univ Adelaide, Adelaide, SA, Australia
关键词
D O I
10.1109/ICCV51070.2023.00256
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present ShapeScaffolder, a structure-based neural network for generating colored 3D shapes based on text input. The approach, similar to providing scaffolds as internal structural supports and adding more details to them, aims to capture finer text-shape connections and improve the quality of generated shapes. Traditional text-to- shape methods often generate 3D shapes as a whole. However, humans tend to understand both shape and text as being structure- based. For example, a table is interpreted as being composed of legs, a seat, and a back; similarly, texts possess inherent linguistic structures that can be analyzed as dependency graphs, depicting the relationships between entities within the text. We believe structure-aware shape generation can bring finer text-shape connections and improve shape generation quality. However, the lack of explicit shape structure and the high freedom of text structure make cross-modality learning challenging. To address these challenges, we first build the structured shape implicit fields in an unsupervised manner. We then propose the part-level attention mechanism between shape parts and textual graph nodes to align the two modalities at the structural level. Finally, we employ a shape refiner to add further detail to the predicted structure, yielding the final results. Extensive experimentation demonstrates that our approaches outperform state-of-the-art methods in terms of both shape fidelity and shape-text matching. Our methods also allow for part-level manipulation and improved part-level completeness.
引用
收藏
页码:2715 / 2724
页数:10
相关论文
共 50 条
  • [21] Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning
    Wang, Fei
    Xu, Zhewei
    Szekely, Pedro
    Chen, Muhao
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5037 - 5048
  • [22] Design representation for performance evaluation of 3D shapes in structure-aware generative design
    Li, Xingang
    Xie, Charles
    Sha, Zhenghui
    DESIGN SCIENCE, 2023, 9
  • [23] Table-to-Text Generation by Structure-Aware Seq2seq Learning
    Liu, Tianyu
    Wang, Kexiang
    Sha, Lei
    Chang, Baobao
    Sui, Zhifang
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4881 - 4888
  • [24] Structure-Aware Editable Morphable Model for 3D Facial Detail Animation and Manipulation
    Ling, Jingwang
    Wang, Zhibo
    Lu, Ming
    Wang, Quan
    Qian, Chen
    Xu, Feng
    COMPUTER VISION - ECCV 2022, PT III, 2022, 13663 : 249 - 267
  • [25] Structure-aware Knowledge Graph-to-Text Generation with Planning Selection and Similarity Distinction
    Zhao, Feng
    Zou, Hongzhi
    Yan, Cheng
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 8693 - 8703
  • [26] Towards Implicit Text-Guided 3D Shape Generation
    Liu, Zhengzhe
    Wang, Yi
    Qi, Xiaojuan
    Fu, Chi-Wing
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17875 - 17885
  • [27] StructCoder: Structure-Aware Transformer for Code Generation
    Tipirneni, Sindhu
    Zhu, Ming
    Reddy, Chandan K.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (03)
  • [28] SEAG: Structure-Aware Event Causality Generation
    Tao, Zhengwei
    Jin, Zhi
    Bai, Xiaoying
    Zhao, Haiyan
    Dou, Chengfeng
    Zhao, Yongqiang
    Wang, Fang
    Tao, Chongyang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4631 - 4644
  • [29] Response Generation via Structure-Aware Constraints
    Guan, Mengyu
    Wang, Zhongqing
    Zhou, Guodong
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (06)
  • [30] Nerflets: Local Radiance Fields for Efficient Structure-Aware 3D Scene Representation from 2D Supervision
    Zhang, Xiaoshuai
    Kundu, Abhijit
    Funkhouser, Thomas
    Guibas, Leonidas
    Su, Hao
    Genova, Kyle
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8274 - 8284