AvatarVerse: High-Quality & Stable 3D Avatar Creation from Text and Pose

被引：0

作者：

Zhang, Huichao ^{[1
]}

Chen, Bowen ^{[1
]}

Yang, Hao ^{[1
]}

Qu, Liao ^{[1
,2
]}

Wang, Xu ^{[1
]}

Chen, Li ^{[1
]}

Long, Chao ^{[1
]}

Zhu, Feida ^{[1
]}

Du, Daniel ^{[1
]}

Zheng, Min ^{[1
]}

机构：

[1] ByteDance, Beijing, Peoples R China

[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7 | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Creating expressive, diverse and high-quality 3D avatars from highly customized text descriptions and pose guidance is a challenging task, due to the intricacy of modeling and texturing in 3D that ensure details and various styles (realistic, fictional, etc). We present AvatarVerse, a stable pipeline for generating expressive high-quality 3D avatars from nothing but text descriptions and pose guidance. In specific, we introduce a 2D diffusion model conditioned on DensePose signal to establish 3D pose control of avatars through 2D images, which enhances view consistency from partially observed scenarios. It addresses the infamous Janus Problem and significantly stablizes the generation process. Moreover, we propose a progressive high-resolution 3D synthesis strategy, which obtains substantial improvement over the quality of the created 3D avatars. To this end, the proposed AvatarVerse pipeline achieves zero-shot 3D modeling of 3D avatars that are not only more expressive, but also in higher quality and fidelity than previous works. Rigorous qualitative evaluations and user studies showcase AvatarVerse's superiority in synthesizing high-fidelity 3D avatars, leading to a new standard in high-quality and stable 3D avatar creation. Our project page is: https://avatarverse3d.github.io/

引用

页码：7124 / 7132

页数：9

共 50 条

[21] High-Quality 3D Face Reconstruction from Multi-View Images
Cai L.
Guo Y.
Zhang J.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (02): : 305 - 314
[22] TEXT2AVATAR: TEXT TO 3D HUMAN AVATAR GENERATION WITH CODEBOOK-DRIVEN BODY CONTROLLABLE ATTRIBUTE
Gong, Chaoqun
Dai, Yuqin
Li, Ronghui
Bao, Achun
Li, Jun
Yang, Jian
Zhang, Yachao
Li, Xiu
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 16 - 20
[23] Portrait3D: Text-Guided High-Quality 3D Portrait Generation Using Pyramid Representation and GANs Prior
Wu, Yiqian
Xu, Hao
Tang, Xiangjun
Chen, Xien
Tang, Siyu
Zhang, Zhebin
Li, Chen
Jin, Xiaogang
ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (04):
[24] Rapid 3D Avatar Creation System Using a Single Depth Camera
Lim, Hwasup
Kang, Junseok
Ahn, Sang Chul
2019 26TH IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES (VR), 2019, : 1329 - 1330
[25] AgileAvatar: Stylized 3D Avatar Creation via Cascaded Domain Bridging
Sang, Shen
Zhi, Tiancheng
Song, Guoxian
Liu, Minghao
Lai, Chunpong
Liu, Jing
Wen, Xiang
Davis, James
Luo, Linjie
PROCEEDINGS SIGGRAPH ASIA 2022, 2022,
[26] High-quality, customizable heuristics for RNA 3D structure alignment
Zurkowski, Michal
Antczak, Maciej
Szachniuk, Marta
BIOINFORMATICS, 2023, 39 (05)
[27] 3D Transformer-GAN for High-Quality PET Reconstruction
Luo, Yanmei
Wang, Yan
Zu, Chen
Zhan, Bo
Wu, Xi
Zhou, Jiliu
Shen, Dinggang
Zhou, Luping
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VI, 2021, 12906 : 276 - 285
[28] High-Quality 3D Face Reconstruction with Affine Convolutional Networks
Lin, Zhigian
Lin, Jiangke
Li, Lincheng
Yuan, Yi
Zou, Zhengxia
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2495 - 2503
[29] High-Quality Progressive Alignment of Large 3D Microscopy Data
Venkat, Aniketh
Hoang, Duong
Gyulassy, Attila
Bremer, Peer-Timo
Federer, Frederick
Angelucci, Alessandra
Pascucci, Valerio
2022 IEEE 12TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV 2022), 2022, : 63 - 72
[30] Polarized 3D: High-Quality Depth Sensing with Polarization Cues
Kadambi, Achuta
Taamazyan, Vage
Shi, Boxin
Raskar, Ramesh
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3370 - 3378

← 1 2 3 4 5 →