AvatarVerse: High-Quality & Stable 3D Avatar Creation from Text and Pose

被引:0
|
作者
Zhang, Huichao [1 ]
Chen, Bowen [1 ]
Yang, Hao [1 ]
Qu, Liao [1 ,2 ]
Wang, Xu [1 ]
Chen, Li [1 ]
Long, Chao [1 ]
Zhu, Feida [1 ]
Du, Daniel [1 ]
Zheng, Min [1 ]
机构
[1] ByteDance, Beijing, Peoples R China
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Creating expressive, diverse and high-quality 3D avatars from highly customized text descriptions and pose guidance is a challenging task, due to the intricacy of modeling and texturing in 3D that ensure details and various styles (realistic, fictional, etc). We present AvatarVerse, a stable pipeline for generating expressive high-quality 3D avatars from nothing but text descriptions and pose guidance. In specific, we introduce a 2D diffusion model conditioned on DensePose signal to establish 3D pose control of avatars through 2D images, which enhances view consistency from partially observed scenarios. It addresses the infamous Janus Problem and significantly stablizes the generation process. Moreover, we propose a progressive high-resolution 3D synthesis strategy, which obtains substantial improvement over the quality of the created 3D avatars. To this end, the proposed AvatarVerse pipeline achieves zero-shot 3D modeling of 3D avatars that are not only more expressive, but also in higher quality and fidelity than previous works. Rigorous qualitative evaluations and user studies showcase AvatarVerse's superiority in synthesizing high-fidelity 3D avatars, leading to a new standard in high-quality and stable 3D avatar creation. Our project page is: https://avatarverse3d.github.io/
引用
收藏
页码:7124 / 7132
页数:9
相关论文
共 50 条
  • [31] 3D site effects:: A thorough analysis of a high-quality datasset
    Chávez-García, FJ
    Castillo, J
    Stephenson, WR
    BULLETIN OF THE SEISMOLOGICAL SOCIETY OF AMERICA, 2002, 92 (05) : 1941 - 1951
  • [32] Radiance-field holography for high-quality 3D reconstruction
    Liu, Taijiang
    Ning, Honglong
    Cao, Hongkun
    Luo, Dongxiang
    Tu, Kefeng
    Liu, Xianzhe
    Zhu, Zhennan
    Chen, Haoyan
    Su, Guoping
    Yao, Rihui
    Peng, Junbiao
    OPTICS AND LASERS IN ENGINEERING, 2024, 178
  • [33] Range Sensor and Silhouette Fusion for High-Quality 3D Scanning
    Narayan, Karthik S.
    Sha, James
    Singh, Arjun
    Abbeel, Pieter
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 3617 - 3624
  • [34] Fusion-based high-quality polarization 3D reconstruction
    Liu, Rui
    Liang, Hao
    Wang, Zhongyuan
    Ma, Jiayi
    Tian, Xin
    OPTICS AND LASERS IN ENGINEERING, 2023, 162
  • [35] Towards High-Quality and Disentangled Face Editing in a 3D GAN
    Jiang, Kaiwen
    Chen, Shu-Yu
    Liu, Feng-Lin
    Fu, Hongbo
    Gao, Lin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 2533 - 2544
  • [36] Robust Computer Vision Techniques for High-quality 3D Modeling
    Lee, Joon-Young
    Jung, Jiyoung
    Bok, Yunsu
    Park, Jaesik
    Choi, Dong-Geol
    Han, Yudeog
    Kweon, In So
    2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 6 - 10
  • [37] Creating High-quality 3D Assets for Realistic xR Solutions
    Giorgi, Daniela
    ERCIM NEWS, 2024, (137):
  • [38] MinD-3D: Reconstruct High-Quality 3D Objects in Human Brain
    Gao, Jianxiong
    Fu, Yuqian
    Wang, Yun
    Qian, Xuelin
    Feng, Jianfeng
    Fu, Yanwei
    COMPUTER VISION - ECCV 2024, PT XLVII, 2025, 15105 : 312 - 329
  • [39] Text-driven 3D Avatar Animation with Emotional and Expressive Behaviors
    Hu, Li
    Qi, Jinwei
    Zhang, Bang
    Pan, Pan
    Xu, Yinghui
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2816 - 2818
  • [40] 3D Avatar Approach for Continuous Sign Movement Using Speech/Text
    Das Chakladar, Debashis
    Kumar, Pradeep
    Mandal, Shubham
    Roy, Partha Pratim
    Iwamura, Masakazu
    Kim, Byung-Gyu
    APPLIED SCIENCES-BASEL, 2021, 11 (08):