Evaluation of a Korean Lip-Sync System for an Android Robot

被引:0
|
作者
Hyung, Hyun-Jun [1 ,2 ]
Ahn, Byeong-Kyu [2 ]
Choi, Dongwoon [2 ]
Lee, Dukyeon [2 ]
Lee, Dong-Wook [1 ,2 ]
机构
[1] Korea Univ Sci & Technol, Robot & Virtual Engn, Ansan 426910, South Korea
[2] Korea Inst Ind Technol, Robot R&D Grp, Ansan 426910, South Korea
关键词
Lip-sync; android robot; mouth shape; lip-sync timing; EveR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lip-syncing of android robots resembling people is essential to accurately convey their intentions to humans. In this paper, we develop a system of Korean lip-syncing, with the assumption that people can guess a word or phrase from watching a lip-syncing robot without sound. The mouth shape for 10 single vowels was generated based on a Korean single vowels triangle chart. Robots can lip-sync in real time a variety of words and sentences using 10 mouth shapes. We performed experiments recording a mouth robot and an announcer reading text. We conducted a survey to assess humans guessing the representations of a female announcer and of a robot to compare the percent of correct answers in each case. Additionally, we also conducted a survey of robot mouth shapes and lip-sync timing to assess the reaction of subjects on 5-Likert scales. Results indicate that the percent of correct guesses from the mouth shape of the robot was one third of that from the human announcer. Subjects assessed the mouth shape and lip-sync timing of the robot as being somewhat unnatural. We expect that android robot lip-syncing currently uses mouth shapes that are perceived as lying in the uncanny valley when subjects try to interpret them. Thus, we will present a more natural mouth shape, add mouth shapes for diphthongs, and develop a mouth shape that varies with voice volume, improving the rate of lip-sync recognition.
引用
收藏
页码:78 / 82
页数:5
相关论文
共 50 条
  • [41] Web-based live speech-driven lip-sync An audio-driven rule-based approach
    Llorach, Gerard
    Evans, Alun
    Blat, Josep
    Grimm, Giso
    Hohmann, Volker
    2016 8TH INTERNATIONAL CONFERENCE ON GAMES AND VIRTUAL WORLDS FOR SERIOUS APPLICATIONS (VS-GAMES), 2016,
  • [42] NEW METHODS OF RECORDING 16MM MAGNETIC LIP-SYNC SOUND USING A MAGNETIC-OPTICAL SOUND PROJECTOR
    ASKREN, LT
    DWYER, RJ
    JOURNAL OF THE SMPTE-SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS, 1957, 66 (03): : 143 - 143
  • [43] High-definition multi-scale voice-driven facial animation: enhancing lip-sync clarity and image detail
    Zhang, Long
    Zhou, QingHua
    Tang, Shuai
    Chen, Yunxiang
    VISUAL COMPUTER, 2024,
  • [44] Real-Time Continuous Phoneme Recognition System Using Class-Dependent Tied-Mixture HMM With HBT Structure for Speech-Driven Lip-Sync
    Park, Junho
    Ko, Hanseok
    IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (07) : 1299 - 1306
  • [45] Identification and Evaluation of the Face System of a Child Android Robot Affetto for Surface Motion Design
    Ishihara, Hisashi
    Wu, Binyi
    Asada, Minoru
    FRONTIERS IN ROBOTICS AND AI, 2018, 5
  • [46] A Speech and Lip Authentication System Based On Android Smart Phone
    Zhang, Xinman
    Zhang, Jiayu
    He, Tingting
    Chen, Yiyu
    Shen, Yuanjun
    Xu, Xuebin
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: IOT AND SMART CITY (ICIT 2018), 2018, : 110 - 114
  • [47] Analysis and generation of laughter motions, and evaluation in an android robot
    Ishi, Carlos Toshinori
    Minato, Takashi
    Ishiguro, Hiroshi
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2019, 8
  • [48] Android System Security Evaluation
    Khokhlov, Igor
    Reznik, Leon
    2018 15TH IEEE ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2018,
  • [49] Educational System with the Android Robot SAYA and Field Trial
    Hashimoto, Takuya
    Kobayashi, Hiroshi
    Kato, Naoki
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 766 - 771
  • [50] The Performance of a Lip-Sync Imagery Model, New Combinations of Signals, a Supplemental Bond Graph Classifier, and Deep Formula Detection as an Extraction and Root Classifier for Electroencephalograms and Brain-Computer Interfaces
    Naebi, Ahmad
    Feng, Zuren
    APPLIED SCIENCES-BASEL, 2023, 13 (21):