Automatic Corpus-based Tone using K-TOBI Representation

被引:0
|
作者
Lee, JS [1 ]
Kim, B [1 ]
Lee, GG [1 ]
机构
[1] Pohang Univ Sci & Technol, Dept Comp Sci & Engn, Pohang 790784, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a prosody generation axchitecture based on K-ToBI (Korean Tone and Break Index) representation. ToBI is, a multi-tier representation system based on linguistic knowledge to transcribe events in an utterance. The TTS system which adopts ToBI as an intermediate representation is known to exhibit higher flexibility, modularity and domain/task portability compared with the direct prosody generation TTS systems. However, the cost of corpus preparation is very expensive for practical-level performance because the TbBI labeled corpus has been manually constructed by many prosody experts and normally requires large amount of data for statistical prosody modeling. Contrary to previous ToBI-based systems, this paper proposes a new method which transcribes the K-ToBI labels completely automatically in Korean speech. We developed automatic corpus-based K-TOBI labeling tools and prediction methods based on several lexico-syntactic linguistic features for decision-tree induction. We demonstrated the performance of F0 generation from automatically predicted K-ToBI labels, and confirmed that the performance is reasonably comparable with state-of-the-art direct prosody generation methods and previous TOBI-based methods.
引用
收藏
页码:134 / 142
页数:9
相关论文
共 50 条
  • [1] Corpus-based induction of lexical representation and meaning
    Lapata, Maria
    Proceedings of the National Conference on Artificial Intelligence, 1999,
  • [2] Purely Corpus-based Automatic Conversation Authoring
    Duplessis, Guillaume Dubuisson
    Letard, Vincent
    Ligozat, Anne-Laure
    Rosset, Sophie
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2728 - 2735
  • [3] Automatic SNOMED classification - a corpus-based method
    deBruijn, LM
    Hasman, A
    Arends, JW
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 1997, 54 (1-2) : 115 - 122
  • [4] Translating the representation of the tourist landscape: A corpus-based study
    Gandin, Stefania
    TOPICS IN LINGUISTICS, 2015, 15 (01)
  • [5] ATTITUDE and Identity Categorizations: A Corpus-based Study of Gender Representation
    Bakar, Kesumawati A.
    INTERNATIONAL CONFERENCE ON EDUCATION & EDUCATIONAL PSYCHOLOGY 2013 (ICEEPSY 2013), 2014, 112 : 747 - 756
  • [6] A Corpus-Based Approach for Automatic Thai Unknown Word Recognition Using Boosting Techniques
    Techo, Jakkrit
    Nattee, Cholwich
    Theeramunkong, Thanaruk
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (12): : 2321 - 2333
  • [7] Learn to blend in!: A corpus-based analysis of the representation of women in mining
    Norberg, Cathrine
    Faltholm, Ylva
    EQUALITY DIVERSITY AND INCLUSION, 2018, 37 (07): : 698 - 712
  • [8] Portuguese corpus-based learning using ETL
    Milidiú, Ruy Luiz
    dos Santos, Cícero Nogueira
    Duarte, Julio Cesar
    Journal of the Brazilian Computer Society, 2008, 14 (04) : 17 - 27
  • [9] A Corpus-Based Approach for Automatic Thai Unknown Word Recognition using Ensemble Learning Techniques
    TeCho, Jakkrit
    Nattee, Cholwich
    Theeramunkong, Thanaruk
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 533 - 540
  • [10] The Automatic Identification of Conceptual Metaphors in Hungarian Texts: A Corpus-Based Analysis
    Babarczy, Anna
    Bencze, Ildiko M.
    Feketel, Istvan
    Simon, Eszter
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : E31 - E36