AvatarVerse: High-Quality & Stable 3D Avatar Creation from Text and Pose

被引：0

作者：

Zhang, Huichao ^{[1
]}

Chen, Bowen ^{[1
]}

Yang, Hao ^{[1
]}

Qu, Liao ^{[1
,2
]}

Wang, Xu ^{[1
]}

Chen, Li ^{[1
]}

Long, Chao ^{[1
]}

Zhu, Feida ^{[1
]}

Du, Daniel ^{[1
]}

Zheng, Min ^{[1
]}

机构：

[1] ByteDance, Beijing, Peoples R China

[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7 | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Creating expressive, diverse and high-quality 3D avatars from highly customized text descriptions and pose guidance is a challenging task, due to the intricacy of modeling and texturing in 3D that ensure details and various styles (realistic, fictional, etc). We present AvatarVerse, a stable pipeline for generating expressive high-quality 3D avatars from nothing but text descriptions and pose guidance. In specific, we introduce a 2D diffusion model conditioned on DensePose signal to establish 3D pose control of avatars through 2D images, which enhances view consistency from partially observed scenarios. It addresses the infamous Janus Problem and significantly stablizes the generation process. Moreover, we propose a progressive high-resolution 3D synthesis strategy, which obtains substantial improvement over the quality of the created 3D avatars. To this end, the proposed AvatarVerse pipeline achieves zero-shot 3D modeling of 3D avatars that are not only more expressive, but also in higher quality and fidelity than previous works. Rigorous qualitative evaluations and user studies showcase AvatarVerse's superiority in synthesizing high-fidelity 3D avatars, leading to a new standard in high-quality and stable 3D avatar creation. Our project page is: https://avatarverse3d.github.io/

引用

页码：7124 / 7132

页数：9

共 50 条

[31] 3D site effects:: A thorough analysis of a high-quality datasset
Chávez-García, FJ
Castillo, J
Stephenson, WR
BULLETIN OF THE SEISMOLOGICAL SOCIETY OF AMERICA, 2002, 92 (05) : 1941 - 1951
[32] Radiance-field holography for high-quality 3D reconstruction
Liu, Taijiang
Ning, Honglong
Cao, Hongkun
Luo, Dongxiang
Tu, Kefeng
Liu, Xianzhe
Zhu, Zhennan
Chen, Haoyan
Su, Guoping
Yao, Rihui
Peng, Junbiao
OPTICS AND LASERS IN ENGINEERING, 2024, 178
[33] Range Sensor and Silhouette Fusion for High-Quality 3D Scanning
Narayan, Karthik S.
Sha, James
Singh, Arjun
Abbeel, Pieter
2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 3617 - 3624
[34] Fusion-based high-quality polarization 3D reconstruction
Liu, Rui
Liang, Hao
Wang, Zhongyuan
Ma, Jiayi
Tian, Xin
OPTICS AND LASERS IN ENGINEERING, 2023, 162
[35] Towards High-Quality and Disentangled Face Editing in a 3D GAN
Jiang, Kaiwen
Chen, Shu-Yu
Liu, Feng-Lin
Fu, Hongbo
Gao, Lin
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 2533 - 2544
[36] Robust Computer Vision Techniques for High-quality 3D Modeling
Lee, Joon-Young
Jung, Jiyoung
Bok, Yunsu
Park, Jaesik
Choi, Dong-Geol
Han, Yudeog
Kweon, In So
2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 6 - 10
[37] Creating High-quality 3D Assets for Realistic xR Solutions
Giorgi, Daniela
ERCIM NEWS, 2024, (137):
[38] MinD-3D: Reconstruct High-Quality 3D Objects in Human Brain
Gao, Jianxiong
Fu, Yuqian
Wang, Yun
Qian, Xuelin
Feng, Jianfeng
Fu, Yanwei
COMPUTER VISION - ECCV 2024, PT XLVII, 2025, 15105 : 312 - 329
[39] Text-driven 3D Avatar Animation with Emotional and Expressive Behaviors
Hu, Li
Qi, Jinwei
Zhang, Bang
Pan, Pan
Xu, Yinghui
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2816 - 2818
[40] 3D Avatar Approach for Continuous Sign Movement Using Speech/Text
Das Chakladar, Debashis
Kumar, Pradeep
Mandal, Shubham
Roy, Partha Pratim
Iwamura, Masakazu
Kim, Byung-Gyu
APPLIED SCIENCES-BASEL, 2021, 11 (08):

← 1 2 3 4 5 →