Developing a Thai emotional speech corpus from Lakorn (EMOLA)

被引:0
|
作者
Sawit Kasuriya
Thanaruk Theeramunkong
Chai Wutiwiwatchai
Piyawat Sukhummek
机构
[1] Thammasat University,School of Information, Computer and Communication Technologies, Sirindhorn International Institute of Technology
[2] Thammasat University,School of Information, Computer and Communication Technologies, Sirindhorn International Institute of Technology
[3] National Electronics and Computer Technology Center (NECTEC),Academy of Science
[4] Royal Society of Thailand,undefined
来源
关键词
Simulated emotional speech corpus; Thai emotional speech corpus; Pleasure-Arousal-Dominance emotional state model; Collection-level; Annotator-oriented; Actor-oriented statistics;
D O I
暂无
中图分类号
学科分类号
摘要
Advances in emotional speech recognition and synthesis essentially rely on the availability of annotated emotional speech corpora. As a low resource language, the Thai language critically lacks corpora of emotional speech, although a few corpora have been constructed for speech recognition and synthesis. This paper presents the design of a Thai emotional speech corpus (namely EMOLA), its construction and annotation process, and its analysis. In the corpus design, four basic types with twelve subtypes of emotions are defined with consideration of the Pleasure-Arousal-Dominance emotional state model. To construct the corpus, a series of Thai dramas (1397 min) were selected and its video clips of approximately 868 min were annotated. As a result, 8987 transcriptions (of conversation turns) were derived in total, with each transcription tagged as one basic type and a few subtypes. Finally, an analysis was conducted to describe the characteristics of this corpus in three sets of statistics: collection-level, annotator-oriented and actor-oriented statistics.
引用
收藏
页码:17 / 55
页数:38
相关论文
共 50 条
  • [31] JVNV: A Corpus of Japanese Emotional Speech with Verbal Content and Nonverbal Expressions
    Xin, Detai
    Jiang, Junfeng
    Takamichi, Shinnosuke
    Saito, Yuki
    Aizawa, Akiko
    Saruwatari, Hiroshi
    arXiv, 2023,
  • [32] Emotional Speech Recognition Using Rhythm Metrics and a New Arabic Corpus
    Meftah, Ali H.
    Qamhan, Mustafa
    Alotaibi, Yousef
    Selouani, Sid-Ahmed
    2020 16TH IEEE INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2020), 2020, : 57 - 62
  • [33] Signal energy-based Automatic Speech Splitter: A tool for developing speech corpus
    Suyanto
    TENCON 2007 - 2007 IEEE REGION 10 CONFERENCE, VOLS 1-3, 2007, : 475 - 478
  • [34] PROTOCOL AND BASELINE FOR EXPERIMENTS ON BOGAZICI UNIVERSITY TURKISH EMOTIONAL SPEECH CORPUS
    Kaya, Heysem
    Salah, Albert Ali
    Gurgen, Sadik Fikret
    Ekenel, Hazim
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 1698 - 1701
  • [35] JVNV: A Corpus of Japanese Emotional Speech With Verbal Content and Nonverbal Expressions
    Xin, Detai
    Jiang, Junfeng
    Takamichi, Shinnosuke
    Saito, Yuki
    Aizawa, Akiko
    Saruwatari, Hiroshi
    IEEE ACCESS, 2024, 12 : 19752 - 19764
  • [36] DESCU: Dyadic emotional speech corpus and recognition system for Urdu language
    Qasim, Muhammad
    Habib, Tania
    Urooj, Saba
    Mumtaz, Benazir
    SPEECH COMMUNICATION, 2023, 148 : 40 - 52
  • [37] An Open Source Emotional Speech Corpus for Human Robot Interaction Applications
    James, Jesin
    Tian, Li
    Watson, Catherine Inez
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2768 - 2772
  • [38] An Analysis of Malay Language Emotional Speech Corpus for Emotion Recognition System
    Apandi, Nurfarihah
    Jamil, Nursuriati
    2016 IEEE INDUSTRIAL ELECTRONICS AND APPLICATIONS CONFERENCE (IEACON), 2016, : 225 - 231
  • [39] LOTUS-BI: a Thai-English Code-mixing Speech Corpus
    Thatphithakkul, Sumonmas
    Chunwijitra, Vataya
    Sertsi, Phuttapong
    Chootrakool, Patcharika
    Kasuriya, Sawit
    2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 40 - 44
  • [40] Thai Speech Synthesis with emotional tone Based on Formant Synthesis for Home Robot
    Khorinphan, Chaiyong
    Phansamdaeng, Sukanya
    Saiyod, Saiyan
    2014 THIRD ICT INTERNATIONAL STUDENT PROJECT CONFERENCE (ICT-ISPC), 2014, : 111 - 114