Developing a Thai emotional speech corpus from Lakorn (EMOLA)

被引：0

作者：

Sawit Kasuriya

Thanaruk Theeramunkong

Chai Wutiwiwatchai

Piyawat Sukhummek

机构：

[1] Thammasat University,School of Information, Computer and Communication Technologies, Sirindhorn International Institute of Technology

[2] Thammasat University,School of Information, Computer and Communication Technologies, Sirindhorn International Institute of Technology

[3] National Electronics and Computer Technology Center (NECTEC),Academy of Science

[4] Royal Society of Thailand,undefined

来源：

Language Resources and Evaluation | 2019年 / 53卷

关键词：

Simulated emotional speech corpus; Thai emotional speech corpus; Pleasure-Arousal-Dominance emotional state model; Collection-level; Annotator-oriented; Actor-oriented statistics;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Advances in emotional speech recognition and synthesis essentially rely on the availability of annotated emotional speech corpora. As a low resource language, the Thai language critically lacks corpora of emotional speech, although a few corpora have been constructed for speech recognition and synthesis. This paper presents the design of a Thai emotional speech corpus (namely EMOLA), its construction and annotation process, and its analysis. In the corpus design, four basic types with twelve subtypes of emotions are defined with consideration of the Pleasure-Arousal-Dominance emotional state model. To construct the corpus, a series of Thai dramas (1397 min) were selected and its video clips of approximately 868 min were annotated. As a result, 8987 transcriptions (of conversation turns) were derived in total, with each transcription tagged as one basic type and a few subtypes. Finally, an analysis was conducted to describe the characteristics of this corpus in three sets of statistics: collection-level, annotator-oriented and actor-oriented statistics.

引用

页码：17 / 55

页数：38

共 50 条

[31] JVNV: A Corpus of Japanese Emotional Speech with Verbal Content and Nonverbal Expressions
Xin, Detai
Jiang, Junfeng
Takamichi, Shinnosuke
Saito, Yuki
Aizawa, Akiko
Saruwatari, Hiroshi
arXiv, 2023,
[32] Emotional Speech Recognition Using Rhythm Metrics and a New Arabic Corpus
Meftah, Ali H.
Qamhan, Mustafa
Alotaibi, Yousef
Selouani, Sid-Ahmed
2020 16TH IEEE INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2020), 2020, : 57 - 62
[33] Signal energy-based Automatic Speech Splitter: A tool for developing speech corpus
Suyanto
TENCON 2007 - 2007 IEEE REGION 10 CONFERENCE, VOLS 1-3, 2007, : 475 - 478
[34] PROTOCOL AND BASELINE FOR EXPERIMENTS ON BOGAZICI UNIVERSITY TURKISH EMOTIONAL SPEECH CORPUS
Kaya, Heysem
Salah, Albert Ali
Gurgen, Sadik Fikret
Ekenel, Hazim
2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 1698 - 1701
[35] JVNV: A Corpus of Japanese Emotional Speech With Verbal Content and Nonverbal Expressions
Xin, Detai
Jiang, Junfeng
Takamichi, Shinnosuke
Saito, Yuki
Aizawa, Akiko
Saruwatari, Hiroshi
IEEE ACCESS, 2024, 12 : 19752 - 19764
[36] DESCU: Dyadic emotional speech corpus and recognition system for Urdu language
Qasim, Muhammad
Habib, Tania
Urooj, Saba
Mumtaz, Benazir
SPEECH COMMUNICATION, 2023, 148 : 40 - 52
[37] An Open Source Emotional Speech Corpus for Human Robot Interaction Applications
James, Jesin
Tian, Li
Watson, Catherine Inez
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2768 - 2772
[38] An Analysis of Malay Language Emotional Speech Corpus for Emotion Recognition System
Apandi, Nurfarihah
Jamil, Nursuriati
2016 IEEE INDUSTRIAL ELECTRONICS AND APPLICATIONS CONFERENCE (IEACON), 2016, : 225 - 231
[39] LOTUS-BI: a Thai-English Code-mixing Speech Corpus
Thatphithakkul, Sumonmas
Chunwijitra, Vataya
Sertsi, Phuttapong
Chootrakool, Patcharika
Kasuriya, Sawit
2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 40 - 44
[40] Thai Speech Synthesis with emotional tone Based on Formant Synthesis for Home Robot
Khorinphan, Chaiyong
Phansamdaeng, Sukanya
Saiyod, Saiyan
2014 THIRD ICT INTERNATIONAL STUDENT PROJECT CONFERENCE (ICT-ISPC), 2014, : 111 - 114

← 1 2 3 4 5 →