Developing a Thai emotional speech corpus from Lakorn (EMOLA)

被引:0
|
作者
Sawit Kasuriya
Thanaruk Theeramunkong
Chai Wutiwiwatchai
Piyawat Sukhummek
机构
[1] Thammasat University,School of Information, Computer and Communication Technologies, Sirindhorn International Institute of Technology
[2] Thammasat University,School of Information, Computer and Communication Technologies, Sirindhorn International Institute of Technology
[3] National Electronics and Computer Technology Center (NECTEC),Academy of Science
[4] Royal Society of Thailand,undefined
来源
关键词
Simulated emotional speech corpus; Thai emotional speech corpus; Pleasure-Arousal-Dominance emotional state model; Collection-level; Annotator-oriented; Actor-oriented statistics;
D O I
暂无
中图分类号
学科分类号
摘要
Advances in emotional speech recognition and synthesis essentially rely on the availability of annotated emotional speech corpora. As a low resource language, the Thai language critically lacks corpora of emotional speech, although a few corpora have been constructed for speech recognition and synthesis. This paper presents the design of a Thai emotional speech corpus (namely EMOLA), its construction and annotation process, and its analysis. In the corpus design, four basic types with twelve subtypes of emotions are defined with consideration of the Pleasure-Arousal-Dominance emotional state model. To construct the corpus, a series of Thai dramas (1397 min) were selected and its video clips of approximately 868 min were annotated. As a result, 8987 transcriptions (of conversation turns) were derived in total, with each transcription tagged as one basic type and a few subtypes. Finally, an analysis was conducted to describe the characteristics of this corpus in three sets of statistics: collection-level, annotator-oriented and actor-oriented statistics.
引用
收藏
页码:17 / 55
页数:38
相关论文
共 50 条
  • [1] Developing a Thai emotional speech corpus from Lakorn (EMOLA)
    Kasuriya, Sawit
    Theeramunkong, Thanaruk
    Wutiwiwatchai, Chai
    Sukhummek, Piyawat
    LANGUAGE RESOURCES AND EVALUATION, 2019, 53 (01) : 17 - 55
  • [2] DEVELOPING A THAI EMOTIONAL SPEECH CORPUS
    Kasuriya, Sawit
    Teeramunkong, Thanaruk
    Wutiwiwatchai, Chai
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [3] Developing Tamil Emotional Speech Corpus and Evaluating using SVM
    Joe, C. Vijesh
    2014 International Conference on Science Engineering and Management Research (ICSEMR), 2014,
  • [4] Satja: Thai Elderly Speech Corpus for Speech Recognition
    Prajongjai, Suphunnee
    Triyason, Tuul
    Mongkolnam, Pornchai
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION TECHNOLOGY (IAIT2018), 2018,
  • [5] SUST Bangla Emotional Speech Corpus (SUBESCO): An audio-only emotional speech corpus for Bangla
    Sultana, Sadia
    Rahman, M. Shahidur
    Selim, M. Reza
    Iqbal, M. Zafar
    PLOS ONE, 2021, 16 (04):
  • [6] Emotional Speech Corpus of Croatian Language
    Dropuljic, Branimir
    Chmura, Milosz Thomasz
    Kolak, Antonio
    Petrinovic, Davor
    PROCEEDINGS OF THE 7TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2011), 2011, : 95 - 100
  • [7] EmoChildRu: Emotional Child Russian Speech Corpus
    Lyakso, Elena
    Frolova, Olga
    Dmitrieva, Evgeniya
    Grigorev, Aleksey
    Kaya, Heysem
    Salah, Albert Ali
    Karpov, Alexey
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 144 - 152
  • [8] Emotional Speech Corpus for Persuasive Dialogue System
    Asai, Sara
    Yoshino, Koichiro
    Shinagawa, Seitaro
    Sakti, Sakriani
    Nakamura, Satoshi
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 491 - 497
  • [9] EMOVO Corpus: an Italian Emotional Speech Database
    Costantini, Giovanni
    Iadarola, Iacopo
    Paoloni, Andrea
    Todisco, Massimiliano
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3501 - 3504
  • [10] A Cross-Corpus Recognition of Emotional Speech
    Xiao, Zhongzhe
    Wu, Di
    Zhang, Xiaojun
    Tao, Zhi
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 42 - 46