AvatarVerse: High-Quality & Stable 3D Avatar Creation from Text and Pose

被引:0
|
作者
Zhang, Huichao [1 ]
Chen, Bowen [1 ]
Yang, Hao [1 ]
Qu, Liao [1 ,2 ]
Wang, Xu [1 ]
Chen, Li [1 ]
Long, Chao [1 ]
Zhu, Feida [1 ]
Du, Daniel [1 ]
Zheng, Min [1 ]
机构
[1] ByteDance, Beijing, Peoples R China
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Creating expressive, diverse and high-quality 3D avatars from highly customized text descriptions and pose guidance is a challenging task, due to the intricacy of modeling and texturing in 3D that ensure details and various styles (realistic, fictional, etc). We present AvatarVerse, a stable pipeline for generating expressive high-quality 3D avatars from nothing but text descriptions and pose guidance. In specific, we introduce a 2D diffusion model conditioned on DensePose signal to establish 3D pose control of avatars through 2D images, which enhances view consistency from partially observed scenarios. It addresses the infamous Janus Problem and significantly stablizes the generation process. Moreover, we propose a progressive high-resolution 3D synthesis strategy, which obtains substantial improvement over the quality of the created 3D avatars. To this end, the proposed AvatarVerse pipeline achieves zero-shot 3D modeling of 3D avatars that are not only more expressive, but also in higher quality and fidelity than previous works. Rigorous qualitative evaluations and user studies showcase AvatarVerse's superiority in synthesizing high-fidelity 3D avatars, leading to a new standard in high-quality and stable 3D avatar creation. Our project page is: https://avatarverse3d.github.io/
引用
收藏
页码:7124 / 7132
页数:9
相关论文
共 50 条
  • [41] GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting
    Yang, Chen
    Li, Sikuang
    Fang, Jiemin
    Liang, Ruofan
    Xie, Lingxi
    Zhang, Xiaopeng
    Shen, Wei
    Tian, Qi
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (06):
  • [42] A TEXT-TO-SL SYNTHESIS SYSTEM USING 3D AVATAR TECHNOLOGY
    Gibet, Sylvie
    Marteau, Pierre-Francois
    2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW, 2023,
  • [43] ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting
    Zheng, Hongwei
    Li, Han
    Shi, Bowen
    Dai, Wenrui
    Wang, Botao
    Sun, Yu
    Guo, Min
    Xiong, Hongkai
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2657 - 2662
  • [44] A lossy 3D wavelet transform for high-quality compression of medical video
    Bernabe, Gregorio
    Garcia, Jose M.
    Gonzalez, Jose
    JOURNAL OF SYSTEMS AND SOFTWARE, 2009, 82 (03) : 526 - 534
  • [45] Virtual Reality Aided High-Quality 3D Reconstruction by Remote Drones
    Zhang, Di
    Xu, Feng
    Pun, Chi-Man
    Yang, Yang
    Lan, Rushi
    Wang, Liejun
    Li, Yujie
    Gao, Hao
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2022, 22 (01)
  • [46] MeshPaint 3D V1.5 for high-quality texture maps
    Kennedy, S
    COMPUTER GRAPHICS WORLD, 1996, 19 (10) : 70 - &
  • [47] Optimum conditions for high-quality 3D reconstruction in confocal scanning microscopy
    Kim, Taehoon
    Kim, Taejoong
    Lee, SeungWoo
    Gweon, Dae-Gab
    Seo, Jungwoo
    THREE-DIMENSIONAL AND MULTIDIMENSIONAL MICROSCOPY: IMAGE ACQUISITION AND PROCESSING XIII, 2006, 6090
  • [48] Production of high-quality polydisperse construction mixes for additive 3D technologies
    Gerasimov, M. D.
    Brazhnik, Yu V.
    Gorshkov, P. S.
    Latyshev, S. S.
    INTERNATIONAL CONFERENCE ON MECHANICAL ENGINEERING, AUTOMATION AND CONTROL SYSTEMS 2017, 2018, 327
  • [49] HIGH-QUALITY 3D MODELS AND THEIR USE IN A CULTURAL HERITAGE CONSERVATION PROJECT
    Tucci, G.
    Bonora, V.
    Conti, A.
    Fiorini, L.
    ICOMOS/ISPRS INTERNATIONAL SCIENTIFIC COMMITTEE ON HERITAGE DOCUMENTATION (CIPA) 26TH INTERNATIONAL CIPA SYMPOSIUM - DIGITAL WORKFLOWS FOR HERITAGE CONSERVATION, 2017, 42-2 (W5): : 687 - 693
  • [50] High-quality 3D shape measurement using saturated fringe patterns
    Chen, Bo
    Zhang, Song
    OPTICS AND LASERS IN ENGINEERING, 2016, 87 : 83 - 89