Automatic Corpus-based Tone using K-TOBI Representation

被引:0
|
作者
Lee, JS [1 ]
Kim, B [1 ]
Lee, GG [1 ]
机构
[1] Pohang Univ Sci & Technol, Dept Comp Sci & Engn, Pohang 790784, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a prosody generation axchitecture based on K-ToBI (Korean Tone and Break Index) representation. ToBI is, a multi-tier representation system based on linguistic knowledge to transcribe events in an utterance. The TTS system which adopts ToBI as an intermediate representation is known to exhibit higher flexibility, modularity and domain/task portability compared with the direct prosody generation TTS systems. However, the cost of corpus preparation is very expensive for practical-level performance because the TbBI labeled corpus has been manually constructed by many prosody experts and normally requires large amount of data for statistical prosody modeling. Contrary to previous ToBI-based systems, this paper proposes a new method which transcribes the K-ToBI labels completely automatically in Korean speech. We developed automatic corpus-based K-TOBI labeling tools and prediction methods based on several lexico-syntactic linguistic features for decision-tree induction. We demonstrated the performance of F0 generation from automatically predicted K-ToBI labels, and confirmed that the performance is reasonably comparable with state-of-the-art direct prosody generation methods and previous TOBI-based methods.
引用
收藏
页码:134 / 142
页数:9
相关论文
共 50 条
  • [21] Comparing Corpus-based to Web-based Lookup Techniques for Automatic English Inclusion Detection
    Alex, Beatrice
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2693 - 2697
  • [22] Corpus-based Subtopic Segmentation Using Concept Segment Method
    Chang, Tao-Hsing
    Lee, Chia-Hoang
    Tam, Hak-Ping
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2010, 13 (3B): : 975 - 982
  • [23] Ill, vulnerable and kind: a corpus-based study of the discourse representation of older adults on Weibo
    Zhang, Shuling
    Bao, Kai
    TEXT & TALK, 2025,
  • [24] Using Corpus-Based Approaches in a System for Multilingual Information Retrieval
    Martin Braschler
    Peter Schäuble
    Information Retrieval, 2000, 3 : 273 - 284
  • [25] Using corpus-based approaches in a system for multilingual information retrieval
    Braschler, M
    Schäuble, P
    INFORMATION RETRIEVAL, 2000, 3 (03): : 273 - 284
  • [26] Orthographic representation and variation within the Japanese writing system Some corpus-based observations
    Joyce, Terry
    Hodoscek, Bor
    Nishina, Kikuko
    WRITTEN LANGUAGE AND LITERACY, 2012, 15 (02): : 254 - 278
  • [27] Corpus-Based Arabic Stemming Using N-Grams
    Zitouni, Abdelaziz
    Damankesh, Asma
    Barakati, Foroogh
    Atari, Maha
    Watfa, Mohamed
    Oroumchian, Farhad
    INFORMATION RETRIEVAL TECHNOLOGY, 2010, 6458 : 280 - 289
  • [28] Corpus-based syntactic error detection using syntactic patterns
    Gojenola, K
    Oronoz, M
    6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : B24 - B29
  • [29] Using Corpus-based Linguistic Approaches in Sense Prediction Study
    Hong, Jia-Fei
    Ker, Sue-Jin
    Huang, Chu-Ren
    Ahrens, Kathleen
    PROCEEDINGS OF THE 24TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2010, : 399 - 407
  • [30] A CORPUS-BASED APPROACH TO THE AUTOMATIC MORPHOLOGICAL ANALYSIS OF ESTONIAN COMPUTER-MEDIATED COMMUNICATION
    Muischnek, Kadri
    Kaalep, Heiki-Jaan
    Sirel, Raul
    EESTI RAKENDUSLINGVISTIKA UHINGU AASTARAAMAT, 2011, 7 : 111 - 127