MOS and pair comparison combined methods for quality evaluation of text-to-speech systems

被引：0

作者：

Salza, PL

Foti, E

Nebbia, L

Oreglia, M

机构：

来源：

ACUSTICA | 1996年 / 82卷 / 04期

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The overall quality of three Text-To-Speech (TTS) synthesis systems for Italian with common prosodic control but different diphones and synthesizers was evaluated by means of the combined application of Mean Opinion Score and Pair Comparison methods. Direct comparison between the two methods serves to validate MOS, which is the the technique recommended by CCITT for synthesis evaluation. In the MOS experiment, assessment also included three types of natural speech (normal and degraded) as reference. Eighteen subjects expressed 2880 MOS judgements and made 720 comparisons in all. The results obtained from the two methods showed good agreement. The most important MOS voice parameters used by listeners for differentiating the systems were Global Impression, Voice, Articulation and Pronunciation. The diphones appeared to contribute most to the different judgements, whereas synthesizers were not perceived as different by listeners. This experiment provides positive verification of interlaboratory reproducibility of MOS, which proved to be an effective technique for overall assessment of TTS quality.

引用

页码：650 / 656

页数：7

共 50 条

[21] Evaluation of The Concatenative Turkish Text-to-Speech System
Orhan, Zeynep
Gormez, Zeliha
PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4314 - +
[22] Automatic Syllabification for Danish Text-to-Speech Systems
Beck, Jeppe
Braga, Daniela
Nogueira, Joao
Dias, Miguel Sales
Coelho, Luis
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1291 - 1294
[23] A Comparative Study of Text-to-Speech Systems in LabVIEW
Panoiu, Manuela
Rat, Cezara-Liliana
Panoiu, Caius
SOFT COMPUTING APPLICATIONS, (SOFA 2014), VOL 1, 2016, 356 : 3 - 11
[24] Method of intelligibility testing for text-to-speech systems
Sheffield, E
Polizzi, P
PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : A862 - A865
[25] On the Construction of Unit Databanks for Text-to-Speech Systems
Latsch, Vagner L.
Netto, Sergio L.
PROCEEDINGS OF THE IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 2006, : 340 - 343
[26] Spectral Smoothening Based Waveform Concatenation Technique for Speech Quality Enhancement in Text-to-Speech Systems
Panda, Soumya Priyadarsini
Nayak, Ajit Kumar
ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 425 - 432
[27] CONTROLLING PHONEME SYNTHESIZERS IN TEXT-TO-SPEECH SYSTEMS
RUHL, HW
DREISSIG, D
KULAS, W
NTZ ARCHIV, 1984, 6 (10): : 243 - 248
[28] Duration analysis for malayalam text-to-speech systems
Gopinath, Deepa P.
Divya, Sree J.
Mathew, Reshmi
Rekhila, S. J.
Nair, Achuthsankar S.
ICIT 2006: 9TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2006, : 129 - +
[29] INTONATION IN TEXT-TO-SPEECH SYNTHESIS - EVALUATION OF ALGORITHMS
AKERS, G
LENNIG, M
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 77 (06): : 2157 - 2165
[30] Text-to-speech for low-resource systems
Schnell, M
Küstner, M
Jokisch, O
Hoffmann, R
PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 259 - 262

← 1 2 3 4 5 →