Automatic Corpus-based Tone using K-TOBI Representation

被引：0

作者：

Lee, JS ^{[1
]}

Kim, B ^{[1
]}

Lee, GG ^{[1
]}

机构：

[1] Pohang Univ Sci & Technol, Dept Comp Sci & Engn, Pohang 790784, South Korea

来源：

PROCEEDINGS OF THE 2001 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING | 2001年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a prosody generation axchitecture based on K-ToBI (Korean Tone and Break Index) representation. ToBI is, a multi-tier representation system based on linguistic knowledge to transcribe events in an utterance. The TTS system which adopts ToBI as an intermediate representation is known to exhibit higher flexibility, modularity and domain/task portability compared with the direct prosody generation TTS systems. However, the cost of corpus preparation is very expensive for practical-level performance because the TbBI labeled corpus has been manually constructed by many prosody experts and normally requires large amount of data for statistical prosody modeling. Contrary to previous ToBI-based systems, this paper proposes a new method which transcribes the K-ToBI labels completely automatically in Korean speech. We developed automatic corpus-based K-TOBI labeling tools and prediction methods based on several lexico-syntactic linguistic features for decision-tree induction. We demonstrated the performance of F0 generation from automatically predicted K-ToBI labels, and confirmed that the performance is reasonably comparable with state-of-the-art direct prosody generation methods and previous TOBI-based methods.

引用

页码：134 / 142

页数：9

共 50 条

[21] Comparing Corpus-based to Web-based Lookup Techniques for Automatic English Inclusion Detection
Alex, Beatrice
SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2693 - 2697
[22] Corpus-based Subtopic Segmentation Using Concept Segment Method
Chang, Tao-Hsing
Lee, Chia-Hoang
Tam, Hak-Ping
INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2010, 13 (3B): : 975 - 982
[23] Ill, vulnerable and kind: a corpus-based study of the discourse representation of older adults on Weibo
Zhang, Shuling
Bao, Kai
TEXT & TALK, 2025,
[24] Using Corpus-Based Approaches in a System for Multilingual Information Retrieval
Martin Braschler
Peter Schäuble
Information Retrieval, 2000, 3 : 273 - 284
[25] Using corpus-based approaches in a system for multilingual information retrieval
Braschler, M
Schäuble, P
INFORMATION RETRIEVAL, 2000, 3 (03): : 273 - 284
[26] Orthographic representation and variation within the Japanese writing system Some corpus-based observations
Joyce, Terry
Hodoscek, Bor
Nishina, Kikuko
WRITTEN LANGUAGE AND LITERACY, 2012, 15 (02): : 254 - 278
[27] Corpus-Based Arabic Stemming Using N-Grams
Zitouni, Abdelaziz
Damankesh, Asma
Barakati, Foroogh
Atari, Maha
Watfa, Mohamed
Oroumchian, Farhad
INFORMATION RETRIEVAL TECHNOLOGY, 2010, 6458 : 280 - 289
[28] Corpus-based syntactic error detection using syntactic patterns
Gojenola, K
Oronoz, M
6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : B24 - B29
[29] Using Corpus-based Linguistic Approaches in Sense Prediction Study
Hong, Jia-Fei
Ker, Sue-Jin
Huang, Chu-Ren
Ahrens, Kathleen
PROCEEDINGS OF THE 24TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2010, : 399 - 407
[30] A CORPUS-BASED APPROACH TO THE AUTOMATIC MORPHOLOGICAL ANALYSIS OF ESTONIAN COMPUTER-MEDIATED COMMUNICATION
Muischnek, Kadri
Kaalep, Heiki-Jaan
Sirel, Raul
EESTI RAKENDUSLINGVISTIKA UHINGU AASTARAAMAT, 2011, 7 : 111 - 127

← 1 2 3 4 5 →