Visual-to-Speech Conversion Based on Maximum Likelihood Estimation

被引:0
|
作者
Ra, Rina [1 ]
Aihara, Ryo [1 ]
Takiguchi, Tesuya [1 ]
Ariki, Yasuo [1 ]
机构
[1] Kobe Univ, Grad Sch Syst Informat, Nada Ku, 1-1 Rokkodai, Kobe, Hyogo, Japan
关键词
VOICE CONVERSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a visual-to-speech conversion method that converts voiceless lip movements into voiced utterances without recognizing text information. Inspired by a Gaussian Mixture Model (GMM)-based voice conversion method, GMM is estimated from jointed visual and audio features and input visual features are converted to audio features using maximum likelihood estimation. In order to capture lip movements whose frame rate data is smaller than the audio data, we construct long-term image features. The proposed method has been evaluated using large-vocabulary continuous speech and experimental results show that our proposed method effectively estimates spectral envelopes and fundamental frequencies of audio speech from voiceless lip movements.
引用
收藏
页码:518 / 521
页数:4
相关论文
共 50 条
  • [21] Parameter Estimation for α-GMM Based on Maximum Likelihood Criterion
    Wu, Dalei
    NEURAL COMPUTATION, 2009, 21 (06) : 1776 - 1795
  • [22] PHASE ESTIMATION BASED ON THE MAXIMUM-LIKELIHOOD CRITERION
    ITOH, K
    OHTSUKA, Y
    APPLIED OPTICS, 1983, 22 (19) : 3054 - 3057
  • [23] Maximum likelihood based direction estimation for noncircular signals
    Choi, Yang-Ho
    SIGNAL PROCESSING, 2024, 220
  • [24] Reliability estimation based on survival analysis and maximum likelihood
    Dai Y.
    Liu Q.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2020, 26 (11): : 2976 - 2981
  • [25] An autofocus algorithm for ISAR based on the maximum likelihood estimation
    ATR Key lab, National Univ. of Defense Technology, Changsha 410073, China
    Guofang Keji Daxue Xuebao, 2006, 5 (63-67):
  • [26] Maximum Likelihood Estimation-Based SAR ADC
    Jayaraj, Akshay
    Chandrasekaran, Sanjeev Tannirkulam
    Ganesh, Archana
    Banerjee, Imon
    Sanyal, Arindam
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2019, 66 (08) : 1311 - 1315
  • [27] A maximum likelihood based carrier frequency estimation algorithm
    Bian, DM
    Zhang, GX
    Yi, XY
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 185 - 188
  • [28] Robust adaptive beamforming based on maximum likelihood estimation
    Sun, Xinyu
    Lian, Xiaohua
    Zhou, Jianjiang
    2008 INTERNATIONAL CONFERENCE ON MICROWAVE AND MILLIMETER WAVE TECHNOLOGY PROCEEDINGS, VOLS 1-4, 2008, : 1137 - 1140
  • [29] Maximum likelihood estimation based regression for multivariate calibration
    Guo, Lu
    Peng, Jiangtao
    Xie, Qiwei
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2018, 189 : 316 - 321
  • [30] FUSION OF IKONOS IMAGERY BASED ON MAXIMUM LIKELIHOOD ESTIMATION
    Shi, Aiye
    Tang, Min
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2011, 17 (07): : 945 - 956