DialogueNeRF: towards realistic avatar face-to-face conversation video generation

被引:0
|
作者
Yichao Yan [1 ]
Zanwei Zhou [1 ]
Zi Wang [1 ]
Jingnan Gao [1 ]
Xiaokang Yang [1 ]
机构
[1] Shanghai Jiao Tong University,MoE Key Lab of Artificial Intelligence, AI Institute
来源
Visual Intelligence | / 2卷 / 1期
关键词
Talking face generation; Neural radiance field; Face reenactment; Conversation generation;
D O I
10.1007/s44267-024-00057-8
中图分类号
学科分类号
摘要
Conversation is an essential component of virtual avatar activities in the metaverse. With the development of natural language processing, significant breakthroughs have been made in text and voice conversation generation. However, face-to-face conversations account for the vast majority of daily conversations, while most existing methods focused on single-person talking head generation. In this work, we take a step further and consider generating realistic face-to-face conversation videos. Conversation generation is more challenging than single-person talking head generation, because it requires not only the generation of photo-realistic individual talking heads, but also the listener’s response to the speaker. In this paper, we propose a novel unified framework based on the neural radiance field (NeRF) to address these challenges. Specifically, we model both the speaker and the listener with a NeRF framework under different conditions to control individual expressions. The speaker is driven by the audio signal, while the response of the listener depends on both visual and acoustic information. In this way, face-to-face conversation videos are generated between human avatars, with all the interlocutors modeled within the same network. Moreover, to facilitate future research on this task, we also collected a new human conversation dataset containing 34 video clips. Quantitative and qualitative experiments evaluate our method in different aspects, e.g., image quality, pose sequence trend, and natural rendering of the scene in the generated videos. Experimental results demonstrate that the avatars in the resulting videos are able to carry on a realistic conversation, and maintain individual styles.
引用
收藏
相关论文
共 50 条
  • [41] The roles of face angle and gaze direction in video-mediated face-to-face communication
    Yasuda, Takashi
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2016, 51 : 223 - 223
  • [42] Face-to-face medical conferences are a step towards flattening the hierarchy
    Sharma, Angelica
    BMJ-BRITISH MEDICAL JOURNAL, 2022, 376
  • [43] Contacts with agreements: Towards face-to-face communication modelingContacts with agreements
    Kiyoshi Kobayashi
    John R. Roy
    Kei Fukuyama
    The Annals of Regional Science, 1998, 32 : 389 - 406
  • [44] Video conferencing us talking face-to-face: is video suitable for supportive dialogue? COMMENT
    Wilson, Laurence S.
    INTERNATIONAL JOURNAL OF THERAPY AND REHABILITATION, 2011, 18 (07): : 403 - 403
  • [45] The Expert goes online : Video Conference instead of Face-to-face Meetings
    Wienke, Albrecht
    Seibert, Kim-Victoria
    HNO, 2024, 72 (03) : 207 - 209
  • [46] Face-to-Face Versus Video Assessment of Facial Paralysis: Implications for Telemedicine
    Tan, Jian Rong
    Coulson, Susan
    Keep, Melanie
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2019, 21 (04)
  • [47] Infant emotional engagement in face-to-face and video chat interactions with their mothers
    McClure, Elisabeth
    Chentsova-Dutton, Yulia
    Holochwost, Steven
    Parrott, W. Gerrod
    Barr, Rachel
    ENFANCE, 2020, (03) : 353 - 374
  • [48] Low bit-rate video streaming for face-to-face teleconference
    Wen, Z
    Liu, ZC
    Cohen, M
    Li, J
    Zheng, K
    Huang, T
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1631 - 1634
  • [49] The cognitive interview: comparing face-to-face and video-mediated interviews
    Shahvaroughi, Ahmad
    Ehsan, Hadi Bahrami
    Hatami, Javad
    Shahvaroughi, Mohammad Ali
    Paulo, Rui M.
    JOURNAL OF CRIMINAL PSYCHOLOGY, 2022, 12 (04) : 74 - 89
  • [50] Using Online Video Lectures to Enrich Traditional Face-to-Face Courses
    Makarem, Suzanne C.
    INTERNATIONAL JOURNAL OF INSTRUCTION, 2015, 8 (02) : 155 - 164