Evaluation of a Korean Lip-Sync System for an Android Robot

被引：0

作者：

Hyung, Hyun-Jun ^{[1
,2
]}

Ahn, Byeong-Kyu ^{[2
]}

Choi, Dongwoon ^{[2
]}

Lee, Dukyeon ^{[2
]}

Lee, Dong-Wook ^{[1
,2
]}

机构：

[1] Korea Univ Sci & Technol, Robot & Virtual Engn, Ansan 426910, South Korea

[2] Korea Inst Ind Technol, Robot R&D Grp, Ansan 426910, South Korea

来源：

2016 13TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI) | 2016年

关键词：

Lip-sync; android robot; mouth shape; lip-sync timing; EveR;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Lip-syncing of android robots resembling people is essential to accurately convey their intentions to humans. In this paper, we develop a system of Korean lip-syncing, with the assumption that people can guess a word or phrase from watching a lip-syncing robot without sound. The mouth shape for 10 single vowels was generated based on a Korean single vowels triangle chart. Robots can lip-sync in real time a variety of words and sentences using 10 mouth shapes. We performed experiments recording a mouth robot and an announcer reading text. We conducted a survey to assess humans guessing the representations of a female announcer and of a robot to compare the percent of correct answers in each case. Additionally, we also conducted a survey of robot mouth shapes and lip-sync timing to assess the reaction of subjects on 5-Likert scales. Results indicate that the percent of correct guesses from the mouth shape of the robot was one third of that from the human announcer. Subjects assessed the mouth shape and lip-sync timing of the robot as being somewhat unnatural. We expect that android robot lip-syncing currently uses mouth shapes that are perceived as lying in the uncanny valley when subjects try to interpret them. Thus, we will present a more natural mouth shape, add mouth shapes for diphthongs, and develop a mouth shape that varies with voice volume, improving the rate of lip-sync recognition.

引用

页码：78 / 82

页数：5

共 50 条

[21] EXPLORING PHONETIC CONTEXT-AWARE LIP-SYNC FOR TALKING FACE GENERATION
Park, Se Jin
Kim, Minsu
Choi, Jeongsoo
Ro, Yong Man
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4325 - 4329
[22] Evaluating Effects of Listening to Content with Lip-sync Animation on Head Mounted Displays
Isoyama, Naoya
Terada, Tsutomu
Tsukamoto, Masahiko
PROCEEDINGS OF THE 2017 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2017 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (UBICOMP/ISWC '17 ADJUNCT), 2017, : 666 - 672
[23] Lip-sync in human face animation based on video analysis and spline models
Tang, SS
Liew, AWC
Yan, H
10TH INTERNATIONAL MULTIMEDIA MODELLING CONFERENCE, PROCEEDINGS, 2004, : 102 - 108
[24] Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers
Sun, Yasheng
Zhou, Hang
Wang, Kaisiyuan
Wu, Qianyi
Hong, Zhibin
Liu, Jingtuo
Ding, Errui
Wang, Jingdong
Liu, Ziwei
Koike, Hideki
PROCEEDINGS SIGGRAPH ASIA 2022, 2022,
[25] MILG: Realistic lip-sync video generation with audio-modulated image inpainting
Bao, Han
Zhang, Xuhong
Wang, Qinying
Liang, Kangming
Wang, Zonghui
Ji, Shouling
Chen, Wenzhi
VISUAL INFORMATICS, 2024, 8 (03): : 71 - 81
[26] Lost in Translation: Lip-Sync Deepfake Detection from Audio-Video Mismatch
Bohacek, Matyas
Farid, Hany
IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2024, : 4315 - 4323
[27] Visual dubbing pipeline with localized lip-sync and two-pass identity transfer
Patel, Dhyey
Zouaghi, Houssem
Mudur, Sudhir
Paquette, Eric
Laforest, Serge
Rouillard, Martin
Popa, Tiberiu
COMPUTERS & GRAPHICS-UK, 2023, 110 : 19 - 27
[28] LIP-SYNC EDITING AND MIXING OF NON-PERFORATED MAGNETIC-TAPE USING NEW SYNCHROLOCK TAPE SYSTEM
BUNTING, G
SMPTE JOURNAL, 1977, 86 (07): : 482 - 486
[29] Multimodal translation system using texture-mapped lip-sync images for video mail and automatic dubbing applications
Morishima, S. (shigeo@waseda.jp), 1637, Hindawi Publishing Corporation (2004):
[30] Lip-sync in 'Lipstick':: 1950s popular songs in a television series by Dennis!Potter
Walden, Joshua
JOURNAL OF MUSICOLOGICAL RESEARCH, 2008, 27 (02) : 169 - 195

← 1 2 3 4 5 →