Evaluation of a Korean Lip-Sync System for an Android Robot

被引:0
|
作者
Hyung, Hyun-Jun [1 ,2 ]
Ahn, Byeong-Kyu [2 ]
Choi, Dongwoon [2 ]
Lee, Dukyeon [2 ]
Lee, Dong-Wook [1 ,2 ]
机构
[1] Korea Univ Sci & Technol, Robot & Virtual Engn, Ansan 426910, South Korea
[2] Korea Inst Ind Technol, Robot R&D Grp, Ansan 426910, South Korea
关键词
Lip-sync; android robot; mouth shape; lip-sync timing; EveR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lip-syncing of android robots resembling people is essential to accurately convey their intentions to humans. In this paper, we develop a system of Korean lip-syncing, with the assumption that people can guess a word or phrase from watching a lip-syncing robot without sound. The mouth shape for 10 single vowels was generated based on a Korean single vowels triangle chart. Robots can lip-sync in real time a variety of words and sentences using 10 mouth shapes. We performed experiments recording a mouth robot and an announcer reading text. We conducted a survey to assess humans guessing the representations of a female announcer and of a robot to compare the percent of correct answers in each case. Additionally, we also conducted a survey of robot mouth shapes and lip-sync timing to assess the reaction of subjects on 5-Likert scales. Results indicate that the percent of correct guesses from the mouth shape of the robot was one third of that from the human announcer. Subjects assessed the mouth shape and lip-sync timing of the robot as being somewhat unnatural. We expect that android robot lip-syncing currently uses mouth shapes that are perceived as lying in the uncanny valley when subjects try to interpret them. Thus, we will present a more natural mouth shape, add mouth shapes for diphthongs, and develop a mouth shape that varies with voice volume, improving the rate of lip-sync recognition.
引用
收藏
页码:78 / 82
页数:5
相关论文
共 50 条
  • [21] EXPLORING PHONETIC CONTEXT-AWARE LIP-SYNC FOR TALKING FACE GENERATION
    Park, Se Jin
    Kim, Minsu
    Choi, Jeongsoo
    Ro, Yong Man
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4325 - 4329
  • [22] Evaluating Effects of Listening to Content with Lip-sync Animation on Head Mounted Displays
    Isoyama, Naoya
    Terada, Tsutomu
    Tsukamoto, Masahiko
    PROCEEDINGS OF THE 2017 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2017 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (UBICOMP/ISWC '17 ADJUNCT), 2017, : 666 - 672
  • [23] Lip-sync in human face animation based on video analysis and spline models
    Tang, SS
    Liew, AWC
    Yan, H
    10TH INTERNATIONAL MULTIMEDIA MODELLING CONFERENCE, PROCEEDINGS, 2004, : 102 - 108
  • [24] Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers
    Sun, Yasheng
    Zhou, Hang
    Wang, Kaisiyuan
    Wu, Qianyi
    Hong, Zhibin
    Liu, Jingtuo
    Ding, Errui
    Wang, Jingdong
    Liu, Ziwei
    Koike, Hideki
    PROCEEDINGS SIGGRAPH ASIA 2022, 2022,
  • [25] MILG: Realistic lip-sync video generation with audio-modulated image inpainting
    Bao, Han
    Zhang, Xuhong
    Wang, Qinying
    Liang, Kangming
    Wang, Zonghui
    Ji, Shouling
    Chen, Wenzhi
    VISUAL INFORMATICS, 2024, 8 (03): : 71 - 81
  • [26] Lost in Translation: Lip-Sync Deepfake Detection from Audio-Video Mismatch
    Bohacek, Matyas
    Farid, Hany
    IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2024, : 4315 - 4323
  • [27] Visual dubbing pipeline with localized lip-sync and two-pass identity transfer
    Patel, Dhyey
    Zouaghi, Houssem
    Mudur, Sudhir
    Paquette, Eric
    Laforest, Serge
    Rouillard, Martin
    Popa, Tiberiu
    COMPUTERS & GRAPHICS-UK, 2023, 110 : 19 - 27
  • [28] LIP-SYNC EDITING AND MIXING OF NON-PERFORATED MAGNETIC-TAPE USING NEW SYNCHROLOCK TAPE SYSTEM
    BUNTING, G
    SMPTE JOURNAL, 1977, 86 (07): : 482 - 486
  • [29] Multimodal translation system using texture-mapped lip-sync images for video mail and automatic dubbing applications
    Morishima, S. (shigeo@waseda.jp), 1637, Hindawi Publishing Corporation (2004):
  • [30] Lip-sync in 'Lipstick':: 1950s popular songs in a television series by Dennis!Potter
    Walden, Joshua
    JOURNAL OF MUSICOLOGICAL RESEARCH, 2008, 27 (02) : 169 - 195