Acoustic speech unit segmentation for concatenative synthesis

被引：4

作者：

Torres, H. M. ^{[1
]}

Gurlekian, J. A. ^{[1
]}

机构：

[1] Hosp Clin Buenos Aires, Inst Neurociencias Aplicadas, Consejo Nacl Invest Cient & Tecn, Lab Invest Sensoriales, RA-1120 Buenos Aires, DF, Argentina

来源：

COMPUTER SPEECH AND LANGUAGE | 2008年 / 22卷 / 02期

关键词：

Text to speech; Unit segmentation; Corpus-driven; Polyphones;

D O I：

10.1016/j.csl.2007.07.002

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Synthesis by concatenation of natural speech improves perceptual results when phonemes and syllables are segmented at places where spectral variations are small [Klatt, D., 1987. Review of text-to-speech conversion for English. J. Acoust. Soc. Am 82 (3), 737-793]. An automatic segmentation method is explored here, using a tool based on a combination of Entropy Coding, Multiresolution Analysis, and Kohonen's Self Organized Maps. The segmentation method considers that there are no limits imposed by any linguistic unit. Resulting waveforms represent phone chains dominated by spectral dynamic structures. Each acoustic unit obtained could be composed of a variety of phonemes or a segmented part of them at the unit boundary. The number of units and unit structure are speaker dependent, i.e. rate, segmental and suprasegmental distinctive features affect them as dynamic structure varies. Results obtained from two databases - one male, one female - of 741 sentences each show this dependence, presenting a different number of units and occurrences for each speaker. Nevertheless, both speakers show a high occurrence of three (36-24%) and four (29-27%) phoneme sequences. Vowel-consonant-vowel sequences are the most frequent type (9.7-8.3%). Consonant-vowel syllables, which are phonemically frequent in Spanish (58%), are less represented (6.6-3.2%) using this method. The relevance of half phone segmentation is verified given that 66% for the female speaker and 53% for the male speaker, of the total units start and end with a segmented phone. Perceptual experiments showed that concatenated speech, created with dynamic acoustic units, was judged more natural than with diphone units. (C) 2007 Elsevier Ltd. All rights reserved.

引用

页码：196 / 206

页数：11

共 50 条

[21] Forward masking phenomenon in concatenative speech synthesis
Cernak, M
Rozinaj, G
PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 691 - 694
[22] Speech Acoustic Unit Segmentation Using Hierarchical Dirichlet Processes
Torbati, Amir Hossein Harati Nejad
Picone, Joseph
Sobel, Marc
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 637 - 641
[23] Automatic Labeling Schemes for Concatenative Speech Synthesis
Kacur, Juraj
Cepko, Jozef
Palenik, Andrej
PROCEEDINGS ELMAR-2008, VOLS 1 AND 2, 2008, : 639 - 642
[24] A Concatenative Synthesis Based Speech Synthesiser for Hindi
Gupta, Kshitij
ADVANCES IN COMPUTER AND INFORMATIOM SCIENCES AND ENGINEERING, 2008, : 261 - 264
[25] Control of spectral dynamics in concatenative speech synthesis
Wouters, J
Macon, MW
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (01): : 30 - 38
[26] Nonlinear speech features for the objective detection of discontinuities in concatenative speech synthesis
Pantazis, Y
Stylianou, Y
NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 375 - 383
[27] ACOUSTIC SEGMENTATION OF SPEECH
SITTON, GA
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1970, 2 (01): : 61 - 102
[28] Context-adaptive smoothing for concatenative speech synthesis
Lee, KS
Kim, SR
IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (12) : 422 - 425
[29] Automatic segmentation for construction of signal dictionary in concatenative synthesis
Chowdhury, S
Datta, AK
Chaudhuri, BB
6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 237 - 240
[30] The phase substitutions in Czech harmonic concatenative speech synthesis
Tychtl, Z
Matous, K
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 333 - 340

← 1 2 3 4 5 →