LSM-based unit pruning for concatenative speech synthesis

被引：0

作者：

Bellegarda, Jerome R. ^{[1
]}

机构：

[1] Apple Comp Inc, Speech & Language Technol, Cupertino, CA 95014 USA

来源：

2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3 | 2007年

关键词：

text-to-speech synthesis; unit selection; inventory pruning; outlier removal; unit redundancy management;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The level of quality that can be achieved in concatenative text-to-speech synthesis is primarily governed by the inventory of units used in unit selection. This has led to the collection of ever larger corpora in the quest for ever more natural synthetic speech. As operational considerations limit the size of the unit inventory, however, pruning is critical to removing any instances that prove either spurious or superfluous. This paper proposes a novel pruning strategy based on a data-driven feature extraction framework separately optimized for each unit type in the inventory. A single distinctiveness/redundancy measure can then address, in a consistent manner, the (traditionally separate) problems of outliers and redundant units. Experimental results underscore the viability of this approach for both moderate and aggressive inventory pruning.

引用

页码：521 / 524

页数：4

共 50 条

[41] LSM-based Secure System Monitoring Using Kernel Protection Schemes
Isohara, Takamasa
Takemori, Keisuke
Miyake, Yutaka
Qu, Ning
Perrig, Adrian
FIFTH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY, AND SECURITY: ARES 2010, PROCEEDINGS, 2010, : 591 - 596
[42] Hailstorm: Disaggregated Compute and Storage for Distributed LSM-based Databases
Bindschaedler, Laurent
Goel, Ashvin
Zwaenepoel, Willy
TWENTY-FIFTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXV), 2020, : 301 - 316
[43] An efficient unit-selection method for concatenative Text-to-speech synthesis systems
Gros, Jerneja Zganec
Zganec, Mario
Journal of Computing and Information Technology, 2008, 16 (01) : 69 - 78
[44] An auditory-based distortion measure with application to concatenative speech synthesis
Hansen, JHL
Chappell, DT
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 489 - 495
[45] Forward masking phenomenon in concatenative speech synthesis
Cernak, M
Rozinaj, G
PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 691 - 694
[46] A Flexible Architecture for Urdu Phonemes-Based Concatenative Speech Synthesis
Ahmad, Muhammad Rizwan
Arshad, Muhammad Junaid
MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2016, 35 (03) : 373 - 380
[47] High-Individuality Voice Conversion Based on Concatenative Speech Synthesis
Fujii, Kei
Okawa, Jun
Suigetsu, Kaori
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 26, PARTS 1 AND 2, DECEMBER 2007, 2007, 26 : 483 - 488
[48] Auditory-based distortion measure with application to concatenative speech synthesis
Duke Univ, Durham, United States
IEEE Trans Speech Audio Process, 5 (489-495):
[49] Automatic Labeling Schemes for Concatenative Speech Synthesis
Kacur, Juraj
Cepko, Jozef
Palenik, Andrej
PROCEEDINGS ELMAR-2008, VOLS 1 AND 2, 2008, : 639 - 642
[50] Perseid: A Secondary Indexing Mechanism for LSM-Based Storage Systems
Wang, Jing
Lu, Youyou
Wang, Qing
Zhang, Yuhao
Shu, Jiwu
ACM TRANSACTIONS ON STORAGE, 2024, 20 (02)

← 1 2 3 4 5 →