Using HMM-based Speech Synthesis to Reconstruct the Voice of Individuals with Degenerative Speech Disorders

被引：0

作者：

Veaux, Christophe ^{[1
]}

Yamagishi, Junichi ^{[1
]}

King, Simon ^{[1
]}

机构：

[1] Univ Edinburgh, CSTR, Edinburgh EH8 9YL, Midlothian, Scotland

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

HTS; Voice Cloning; Voice Reconstruction; Assistive Technologies;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

When individuals lose the ability to produce their own speech, due to degenerative diseases such as motor neuron disease (MND) or Parkinson's, they lose not only a functional means of communication but also a display of their individual and group identity. In order to build personalized synthetic voices, attempts have been made to capture the voice before it is lost, using a process known as voice banking. But, for some patients, the speech deterioration frequently coincides or quickly follows diagnosis. Using HMM-based speech synthesis, it is now possible to build personalized synthetic voices with minimal data recordings and even disordered speech. In this approach, the patient's recordings are used to adapt an average voice model pre-trained on many speakers. The structure of the voice model allows some reconstruction of the voice by substituting some components from the average voice in order to compensate for the disorders found in the patient's speech. In this paper, we compare different substitution strategies and introduce a context-dependent model substitution to improve the intelligibility of the synthetic speech while retaining the vocal identity of the patient. A subjective evaluation of the reconstructed voice for a patient with MND shows promising results for this strategy.

引用

页码：966 / 969

页数：4

共 50 条

[21] Roles of the Average Voice in Speaker-adaptive HMM-based Speech Synthesis
Yamagishi, Junichi
Watts, Oliver
King, Simon
Usabaev, Bela
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 418 - +
[22] Analysis of HMM-Based Lombard Speech Synthesis
Raitio, Tuomo
Suni, Antti
Vainio, Martti
Alku, Paavo
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2792 - +
[23] A speech parameter generation algorithm using local variance for HMM-based speech synthesis
Chunwijitra, Vataya
Nose, Takashi
Kobayashi, Takao
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1150 - 1153
[24] Synthesis of stressed speech from isolated neutral speech using HMM-based models
BouGhazale, SE
Hansen, JHL
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1860 - 1863
[25] Evaluation of speech unit modelling for HMM-based speech synthesis for Arabic
Houidhek, Amal
Colotte, Vincent
Mnasri, Zied
Jouvet, Denis
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (04) : 895 - 906
[26] SPEECH-LAUGHS: AN HMM-BASED APPROACH FOR AMUSED SPEECH SYNTHESIS
El Haddad, Kevin
Dupont, Stephane
Urbain, Jerome
Dutoit, Thierry
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4939 - 4943
[27] Speaker and style adaptation using average voice model for style control in HMM-based speech synthesis
Tachibana, Makoto
Izawa, Shinsuke
Nose, Takashi
Kobayashi, Takao
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4633 - 4636
[28] Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion
Gia-Nhu Nguyen
Trung-Nghia Phung
EURASIP Journal on Audio, Speech, and Music Processing, 2017
[29] Reducing over-smoothness in HMM-based speech synthesis using exemplar-based voice conversion
Gia-Nhu Nguyen
Trung-Nghia Phung
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2017,
[30] HMM-Based Emphatic Speech Synthesis Using Unsupervised Context Labeling
Maeno, Yu
Nose, Takashi
Kobayashi, Takao
Ijima, Yusuke
Nakajima, Hideharu
Mizuno, Hideyuki
Yoshioka, Osamu
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1860 - +

← 1 2 3 4 5 →