AvatarVerse: High-Quality & Stable 3D Avatar Creation from Text and Pose

被引:0
|
作者
Zhang, Huichao [1 ]
Chen, Bowen [1 ]
Yang, Hao [1 ]
Qu, Liao [1 ,2 ]
Wang, Xu [1 ]
Chen, Li [1 ]
Long, Chao [1 ]
Zhu, Feida [1 ]
Du, Daniel [1 ]
Zheng, Min [1 ]
机构
[1] ByteDance, Beijing, Peoples R China
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Creating expressive, diverse and high-quality 3D avatars from highly customized text descriptions and pose guidance is a challenging task, due to the intricacy of modeling and texturing in 3D that ensure details and various styles (realistic, fictional, etc). We present AvatarVerse, a stable pipeline for generating expressive high-quality 3D avatars from nothing but text descriptions and pose guidance. In specific, we introduce a 2D diffusion model conditioned on DensePose signal to establish 3D pose control of avatars through 2D images, which enhances view consistency from partially observed scenarios. It addresses the infamous Janus Problem and significantly stablizes the generation process. Moreover, we propose a progressive high-resolution 3D synthesis strategy, which obtains substantial improvement over the quality of the created 3D avatars. To this end, the proposed AvatarVerse pipeline achieves zero-shot 3D modeling of 3D avatars that are not only more expressive, but also in higher quality and fidelity than previous works. Rigorous qualitative evaluations and user studies showcase AvatarVerse's superiority in synthesizing high-fidelity 3D avatars, leading to a new standard in high-quality and stable 3D avatar creation. Our project page is: https://avatarverse3d.github.io/
引用
收藏
页码:7124 / 7132
页数:9
相关论文
共 50 条
  • [21] High-Quality 3D Face Reconstruction from Multi-View Images
    Cai L.
    Guo Y.
    Zhang J.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (02): : 305 - 314
  • [22] TEXT2AVATAR: TEXT TO 3D HUMAN AVATAR GENERATION WITH CODEBOOK-DRIVEN BODY CONTROLLABLE ATTRIBUTE
    Gong, Chaoqun
    Dai, Yuqin
    Li, Ronghui
    Bao, Achun
    Li, Jun
    Yang, Jian
    Zhang, Yachao
    Li, Xiu
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 16 - 20
  • [23] Portrait3D: Text-Guided High-Quality 3D Portrait Generation Using Pyramid Representation and GANs Prior
    Wu, Yiqian
    Xu, Hao
    Tang, Xiangjun
    Chen, Xien
    Tang, Siyu
    Zhang, Zhebin
    Li, Chen
    Jin, Xiaogang
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (04):
  • [24] Rapid 3D Avatar Creation System Using a Single Depth Camera
    Lim, Hwasup
    Kang, Junseok
    Ahn, Sang Chul
    2019 26TH IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2019, : 1329 - 1330
  • [25] AgileAvatar: Stylized 3D Avatar Creation via Cascaded Domain Bridging
    Sang, Shen
    Zhi, Tiancheng
    Song, Guoxian
    Liu, Minghao
    Lai, Chunpong
    Liu, Jing
    Wen, Xiang
    Davis, James
    Luo, Linjie
    PROCEEDINGS SIGGRAPH ASIA 2022, 2022,
  • [26] High-quality, customizable heuristics for RNA 3D structure alignment
    Zurkowski, Michal
    Antczak, Maciej
    Szachniuk, Marta
    BIOINFORMATICS, 2023, 39 (05)
  • [27] 3D Transformer-GAN for High-Quality PET Reconstruction
    Luo, Yanmei
    Wang, Yan
    Zu, Chen
    Zhan, Bo
    Wu, Xi
    Zhou, Jiliu
    Shen, Dinggang
    Zhou, Luping
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VI, 2021, 12906 : 276 - 285
  • [28] High-Quality 3D Face Reconstruction with Affine Convolutional Networks
    Lin, Zhigian
    Lin, Jiangke
    Li, Lincheng
    Yuan, Yi
    Zou, Zhengxia
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2495 - 2503
  • [29] High-Quality Progressive Alignment of Large 3D Microscopy Data
    Venkat, Aniketh
    Hoang, Duong
    Gyulassy, Attila
    Bremer, Peer-Timo
    Federer, Frederick
    Angelucci, Alessandra
    Pascucci, Valerio
    2022 IEEE 12TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV 2022), 2022, : 63 - 72
  • [30] Polarized 3D: High-Quality Depth Sensing with Polarization Cues
    Kadambi, Achuta
    Taamazyan, Vage
    Shi, Boxin
    Raskar, Ramesh
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3370 - 3378