Out of Time: Automated Lip Sync in the Wild

被引:217
|
作者
Chung, Joon Son [1 ]
Zisserman, Andrew [1 ]
机构
[1] Univ Oxford, Visual Geometry Grp, Dept Engn Sci, Oxford, England
基金
英国工程与自然科学研究理事会;
关键词
SPEECH; SYNCHRONIZATION; TRANSLATION;
D O I
10.1007/978-3-319-54427-4_19
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The goal of this work is to determine the audio-video synchronisation between mouth motion and speech in a video. We propose a two-stream ConvNet architecture that enables the mapping between the sound and the mouth images to be trained end-to-end from unlabelled data. The trained network is used to determine the lip-sync error in a video. We apply the network to two further tasks: active speaker detection and lip reading. On both tasks we set a new state-of-the-art on standard benchmark datasets.
引用
收藏
页码:251 / 263
页数:13
相关论文
共 50 条
  • [21] Out-of-sync Schedule Robustness for Time-sensitive Networks
    Craciunas, Silviu S.
    Oliver, Ramon Serna
    17TH IEEE INTERNATIONAL WORKSHOP ON FACTORY COMMUNICATION SYSTEMS 2021 (WFCS 2021), 2021, : 75 - 82
  • [22] OUT OF SYNC MEMORIES - A RECOLLECTION
    BOGDANOVIC, B
    TEMPS MODERNES, 1994, 49 (576-78): : 58 - 62
  • [23] An academic life out of sync
    Ostrow, Ellen
    Chronicle of Higher Education, 2003, 49 (48 SEC. 3)
  • [24] iTunes hacker is out of sync
    不详
    NEW SCIENTIST, 2003, 180 (2424) : 6 - 6
  • [25] Lip-sync Personal Authentication System Using Movement Feature of Lip
    Nakata, Tatsuya
    Kashima, Masayuki
    Sato, Kiminori
    Watanabe, Mutsumi
    2013 INTERNATIONAL CONFERENCE ON BIOMETRICS AND KANSEI ENGINEERING (ICBAKE), 2013, : 273 - 276
  • [26] Speaker Dependent Real-Time Vowel Recognition Algorithm for Lip Sync in Digital Contents
    Hwang, Sun-Min
    Song, Bok-Hee
    Yun, Han-Hyung
    2013 INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS), 2013,
  • [27] FRAMEWORK DEVELOPMENT OF REAL-TIME LIP SYNC ANIMATION ON VISEME BASED HUMAN SPEECH
    Hoon, Loh Ngiik
    Rahman, Khairul Aidil Azlin Abd.
    Chai, Wang Yin
    JURNAL TEKNOLOGI, 2015, 75 (04): : 43 - 48
  • [28] Seeing the Sound: Multilingual Lip Sync for Real-Time Face-to-Face Translation
    Oskooei, Amirkia Rafiei
    Aktas, Mehmet S.
    Keles, Mustafa
    COMPUTERS, 2025, 14 (01)
  • [29] Lip Sync Matters: A Novel Multimodal Forgery Detector
    Shahzad, Sahibzada Adil
    Hashmi, Ammarah
    Khan, Sarwar
    Peng, Yan-Tsung
    Tsao, Yu
    Wang, Hsin-Min
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1885 - 1892
  • [30] Synthesizing Obama: Learning Lip Sync from Audio
    Suwajanakorn, Supasorn
    Seitz, Steven M.
    Kemelmacher-Shlizerman, Ira
    ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (04):