Visual-to-Speech Conversion Based on Maximum Likelihood Estimation

被引:0
|
作者
Ra, Rina [1 ]
Aihara, Ryo [1 ]
Takiguchi, Tesuya [1 ]
Ariki, Yasuo [1 ]
机构
[1] Kobe Univ, Grad Sch Syst Informat, Nada Ku, 1-1 Rokkodai, Kobe, Hyogo, Japan
关键词
VOICE CONVERSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a visual-to-speech conversion method that converts voiceless lip movements into voiced utterances without recognizing text information. Inspired by a Gaussian Mixture Model (GMM)-based voice conversion method, GMM is estimated from jointed visual and audio features and input visual features are converted to audio features using maximum likelihood estimation. In order to capture lip movements whose frame rate data is smaller than the audio data, we construct long-term image features. The proposed method has been evaluated using large-vocabulary continuous speech and experimental results show that our proposed method effectively estimates spectral envelopes and fundamental frequencies of audio speech from voiceless lip movements.
引用
收藏
页码:518 / 521
页数:4
相关论文
共 50 条
  • [31] Intensity correction method based on maximum likelihood estimation
    Han, Yanxiang
    Zhang, Zhisheng
    ELECTRONICS LETTERS, 2012, 48 (14) : 829 - 830
  • [32] Camera Calibration Method Based on Maximum Likelihood Estimation
    Yoshioka, Michifumi
    Omatu, Sigeru
    DISTRIBUTED COMPUTING, ARTIFICIAL INTELLIGENCE, BIOINFORMATICS, SOFT COMPUTING, AND AMBIENT ASSISTED LIVING, PT II, PROCEEDINGS, 2009, 5518 : 616 - 620
  • [33] Genetic algorithm based maximum likelihood DOA estimation
    Li, M
    Lu, Y
    RADAR 2002, 2002, (490): : 502 - 506
  • [34] Radial velocity estimation of moving target based on maximum likelihood estimation
    Gao, Yesheng
    Guo, Xiaojiang
    Liu, Xingzhao
    ELECTRONICS LETTERS, 2018, 54 (16) : 1002 - 1003
  • [35] DOA estimation algorithm based on maximum likelihood estimation for nested array
    Electronic Engineering Institute, Hefei
    230037, China
    不详
    230037, China
    Hangkong Xuebao, 11
  • [36] Improved maximum likelihood frequency offset estimation based on likelihood metric design
    Minn, H
    Tarasak, P
    ICC 2005: IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-5, 2005, : 2150 - 2156
  • [37] Improved maximum likelihood frequency offset estimation based on likelihood metric design
    Minn, Hlaing
    Tarasak, Poramate
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (06) : 2076 - 2086
  • [38] Tutorial on maximum likelihood estimation
    Myung, IJ
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2003, 47 (01) : 90 - 100
  • [39] A note on maximum likelihood estimation
    Hengartner, NW
    AMERICAN STATISTICIAN, 1999, 53 (02): : 123 - 125
  • [40] Robust Maximum Likelihood Estimation
    Bertsimas, Dimitris
    Nohadani, Omid
    INFORMS JOURNAL ON COMPUTING, 2019, 31 (03) : 445 - 458