LSM-based unit pruning for concatenative speech synthesis

被引:0
|
作者
Bellegarda, Jerome R. [1 ]
机构
[1] Apple Comp Inc, Speech & Language Technol, Cupertino, CA 95014 USA
关键词
text-to-speech synthesis; unit selection; inventory pruning; outlier removal; unit redundancy management;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The level of quality that can be achieved in concatenative text-to-speech synthesis is primarily governed by the inventory of units used in unit selection. This has led to the collection of ever larger corpora in the quest for ever more natural synthetic speech. As operational considerations limit the size of the unit inventory, however, pruning is critical to removing any instances that prove either spurious or superfluous. This paper proposes a novel pruning strategy based on a data-driven feature extraction framework separately optimized for each unit type in the inventory. A single distinctiveness/redundancy measure can then address, in a consistent manner, the (traditionally separate) problems of outliers and redundant units. Experimental results underscore the viability of this approach for both moderate and aggressive inventory pruning.
引用
收藏
页码:521 / 524
页数:4
相关论文
共 50 条
  • [21] Admissible stopping in Viterbi beam search for unit selection in concatenative speech synthesis
    Sakai, Shinsuke
    Kawahara, Tatsuya
    Nakamura, Satoshi
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4613 - 4616
  • [22] Columnar Formats for Schemaless LSM-based Document Stores
    Alkowaileet, Wail Y.
    Carey, Michael J.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (10): : 2085 - 2097
  • [23] Implementation of LSM-based RBAC module for embedded system
    Lim, Jae-Deok
    Un, Sung-Kyong
    Kim, Jeong-Nyeo
    Lee, ChoelHoon
    INFORMATION SECURITY APPLICATIONS, 2007, 4867 : 91 - +
  • [24] An LSM-based Tuple Compaction Framework for Apache AsterixDB
    Alkowaileet, Wail Y.
    Alsubaiee, Sattam
    Carey, Michael J.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (09): : 1388 - 1400
  • [25] Isogeometric analysis for parameterized LSM-based structural topology optimization
    Yingjun Wang
    David J. Benson
    Computational Mechanics, 2016, 57 : 19 - 35
  • [26] Introduction to Multilingual Corpus-Based Concatenative Speech Synthesis
    Deprez, Filip
    Odijk, Jan
    De Moortel, Jan
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 357 - 360
  • [27] Fast concatenative speech synthesis using pre-fused speech units based on the plural unit selection and fusion method
    Tamura, Masatsune
    Mizutani, Tatsuya
    Kagoshima, Takehiko
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (02) : 544 - 553
  • [28] SET OF CONCATENATIVE UNITS FOR SPEECH SYNTHESIS
    OLIVE, J
    LIBERMAN, M
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S130 - S130
  • [29] Syllable-Based Concatenative Speech Synthesis for Marathi Language
    Ghate, Pravin M.
    Shirbahadurkar, Suresh D.
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR COMPETITIVE STRATEGIES, 2019, 40 : 615 - 624
  • [30] On the detection of discontinuities in concatenative speech synthesis
    Pantazis, Yannis
    Stylianou, Yannis
    PROGRESS IN NONLINEAR SPEECH PROCESSING, 2007, 4391 : 89 - +