LSM-based unit pruning for concatenative speech synthesis

被引:0
|
作者
Bellegarda, Jerome R. [1 ]
机构
[1] Apple Comp Inc, Speech & Language Technol, Cupertino, CA 95014 USA
关键词
text-to-speech synthesis; unit selection; inventory pruning; outlier removal; unit redundancy management;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The level of quality that can be achieved in concatenative text-to-speech synthesis is primarily governed by the inventory of units used in unit selection. This has led to the collection of ever larger corpora in the quest for ever more natural synthetic speech. As operational considerations limit the size of the unit inventory, however, pruning is critical to removing any instances that prove either spurious or superfluous. This paper proposes a novel pruning strategy based on a data-driven feature extraction framework separately optimized for each unit type in the inventory. A single distinctiveness/redundancy measure can then address, in a consistent manner, the (traditionally separate) problems of outliers and redundant units. Experimental results underscore the viability of this approach for both moderate and aggressive inventory pruning.
引用
收藏
页码:521 / 524
页数:4
相关论文
共 50 条
  • [31] Diphone-based concatenative speech synthesis system for Mongolian
    Davaatsagaan, Munkhtuya
    Paliwal, Kuldip K.
    IMECS 2008: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2008, : 276 - 279
  • [32] Spectral modification for concatenative speech synthesis
    Wouters, J
    Macon, MW
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 941 - 944
  • [33] Dissecting, Designing, and Optimizing LSM-based Data Stores
    Sarkar, Subhadeep
    Athanassoulis, Manos
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 2489 - 2497
  • [34] Allophone-based concatenative speech synthesis system for Russian
    Skrelin, PA
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 156 - 159
  • [35] Discriminative training for concatenative speech synthesis
    Kim, NS
    Park, SS
    IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (01) : 40 - 43
  • [36] LSMPMON: Performance Evaluation Mechanism of LSM-based Secure OS
    Yamauchi, Toshihiro
    Yamamoto, Kenji
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2012, 6 (02): : 81 - 90
  • [37] Specific Acoustic Unit Processing in Concatenative Romanian Speech Synthesis Used for Talking Agents
    Negrescu, Cristian
    Ciobanu, Amelia
    Ilie, Mihai Daniel
    2013 7TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN - COMPUTER DIALOGUE (SPED), 2013,
  • [38] Isogeometric analysis for parameterized LSM-based structural topology optimization
    Wang, Yingjun
    Benson, David J.
    COMPUTATIONAL MECHANICS, 2016, 57 (01) : 19 - 35
  • [39] DualDB: An Efficient LSM-based Publish/Subscribe Storage System
    Qader, Mohiuddin Abdul
    Hristidis, Vagelis
    SSDBM 2017: 29TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2017,
  • [40] Breaking Down Memory Walls in LSM-based Storage Systems
    Luo, Chen
    SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 2817 - 2819