T2TD: Text-3D Generation Model Based on Prior Knowledge Guidance

被引:0
|
作者
Nie, Weizhi [1 ]
Chen, Ruidong [1 ]
Wang, Weijie [2 ]
Lepri, Bruno [3 ]
Sebe, Nicu [2 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300384, Peoples R China
[2] Univ Trento, Dept Informat Engn & Comp Sci, I-38122 Trento, Italy
[3] Fdn Bruno Kessler, I-38122 Trento, Italy
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Solid modeling; Shape; Data models; Knowledge graphs; Legged locomotion; Natural languages; 3D model generation; causal model inference; cross-modal representation; knowledge graph; natural language;
D O I
10.1109/TPAMI.2024.3463753
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, 3D models have been utilized in many applications, such as auto-drivers, 3D reconstruction, VR, and AR. However, the scarcity of 3D model data does not meet its practical demands. Thus, generating high-quality 3D models efficiently from textual descriptions is a promising but challenging way to solve this problem. In this paper, inspired by the creative mechanisms of human imagination, which concretely supplement the target model from ambiguous descriptions built upon human experiential knowledge, we propose a novel text-3D generation model (T2TD). T2TD aims to generate the target model based on the textual description with the aid of experiential knowledge. Its target creation process simulates the imaginative mechanisms of human beings. In this process, we first introduce the text-3D knowledge graph to preserve the relationship between 3D models and textual semantic information, which provides related shapes like humans' experiential information. Second, we propose an effective causal inference model to select useful feature information from these related shapes, which can remove the unrelated structure information and only retain solely the feature information strongly related to the textual description. Third, we adopt a novel multi-layer transformer structure to progressively fuse this strongly related structure information and textual information, compensating for the lack of structural information, and enhancing the final performance of the 3D generation model. The final experimental results demonstrate that our approach significantly improves 3D model generation quality and outperforms the SOTA methods on the text2shape datasets.
引用
收藏
页码:172 / 189
页数:18
相关论文
共 50 条
  • [31] Differential evolution based 3-D guidance law for a realistic interceptor model
    Raghunathan, T.
    Ghose, D.
    APPLIED SOFT COMPUTING, 2014, 16 : 20 - 33
  • [32] Principles of guidance-based path following in 2D and 3D
    Breivik, Morten
    Fossen, Thor I.
    2005 44th IEEE Conference on Decision and Control & European Control Conference, Vols 1-8, 2005, : 627 - 634
  • [33] DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors
    Yan, Zizheng
    Zhou, Jiapeng
    Meng, Fanpeng
    Wu, Yushuang
    Qiu, Lingteng
    Ye, Zisheng
    Cui, Shuguang
    Chen, Guanying
    Han, Xiaoguang
    COMPUTER VISION - ECCV 2024, PT XII, 2025, 15070 : 124 - 141
  • [34] Diverse and Stable 2D Diffusion Guided Text to 3D Generation with Noise Recalibration
    Yang, Xiaofeng
    Liu, Fayao
    Xu, Yi
    Su, Hanjing
    Wu, Qingyao
    Lin, Guosheng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6549 - 6557
  • [35] TEXT-TO-CITY Controllable 3D Urban Block Generation with Latent Diffusion Model
    Zhuang, Junling
    Li, Guanhong
    Xu, Hang
    Xu, Jintu
    Tian, Runjia
    PROCEEDINGS OF THE 29TH INTERNATIONAL CONFERENCE OF THE ASSOCIATION FOR COMPUTER-AIDED ARCHITECTURAL DESIGN RESEARCH IN ASIA, CAADRIA 2024, VOL 2, 2024, : 169 - 178
  • [36] Intelligent generation method of 3D machining process based on process knowledge
    Jing, Xuwen
    Zhu, Yuping
    Liu, Jinfeng
    Zhou, Honggen
    Zhao, Peng
    Liu, Xiaojun
    Tian, Guizhong
    Ye, Hua
    Li, Qun
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2020, 33 (01) : 38 - 61
  • [37] 3D Model-Based Curve Generation and Manipulation
    Quah, Chee Kwang
    Xu, Xiang
    Seah, Hock Soon
    2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 1536 - 1541
  • [38] A Public System for Image Based 3D Model Generation
    Tingdahl, David
    Van Gool, Luc
    COMPUTER VISION/COMPUTER GRAPHICS COLLABORATION TECHNIQUES, MIRAGE 2011, 2011, 6930 : 262 - 273
  • [39] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models
    Yi, Taoran
    Fang, Jiemin
    Wang, Junjie
    Wu, Guanjun
    Xie, Lingxi
    Zhang, Xiaopeng
    Liu, Wenyu
    Tian, Qi
    Wang, Xinggang
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 6796 - 6807
  • [40] Single-View's 3D Explanation based on Prior Knowledge and Constraint Satisfaction
    Zhang Wei
    Wei Hui
    ISCSCT 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY, VOL 2, PROCEEDINGS, 2008, : 146 - 150