ShEMO: a large-scale validated database for Persian speech emotion detection

被引:35
|
作者
Nezami, Omid Mohamad [1 ]
Lou, Paria Jamshid [2 ]
Karami, Mansoureh [2 ]
机构
[1] Islamic Azad Univ, Bijar Branch, Bijar, Iran
[2] Sharif Univ Technol, Tehran, Iran
关键词
Emotional speech; Speech database; Emotion detection; Benchmark; Persian; RECOGNITION; MODEL; AGREEMENT; VALENCE; AROUSAL;
D O I
10.1007/s10579-018-9427-x
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper introduces a large-scale, validated database for Persian called Sharif Emotional Speech Database (ShEMO). The database includes 3000 semi-natural utterances, equivalent to 3h and 25min of speech data extracted from online radio plays. The ShEMO covers speech samples of 87 native-Persian speakers for five basic emotions including anger, fear, happiness, sadness and surprise, as well as neutral state. Twelve annotators label the underlying emotional state of utterances and majority voting is used to decide on the final labels. According to the kappa measure, the inter-annotator agreement is 64% which is interpreted as substantial agreement. We also present benchmark results based on common classification methods in speech emotion detection task. According to the experiments, support vector machine achieves the best results for both gender-independent (58.2%) and gender-dependent models (female=59.4%, male=57.6%). The ShEMO will be available for academic purposes free of charge to provide a baseline for further research on Persian emotional speech.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [41] A Large-scale Database for Less Cooperative Iris Recognition
    Hu, Junxing
    Wang, Leyuan
    Luo, Zhengquan
    Wang, Yunlong
    Sun, Zhenan
    2021 INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2021), 2021,
  • [42] Large-scale biological meta-database management
    Pedersen, Edvard
    Bongo, Lars Ailo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 67 : 481 - 489
  • [43] Wikidata: A large-scale collaborative ontological medical database
    Turki, Houcemeddine
    Shafee, Thomas
    Taieb, Mohamed Ali Hadj
    Ben Aouicha, Mohamed
    Vrandecic, Denny
    Das, Diptanshu
    Hamdi, Helmi
    JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 99
  • [44] PeroxiBase: a database for large-scale evolutionary analysis of peroxidases
    Fawal, Nizar
    Li, Qiang
    Savelli, Bruno
    Brette, Marie
    Passaia, Gisele
    Fabre, Maxime
    Mathe, Catherine
    Dunand, Christophe
    NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : D441 - D444
  • [45] AVA: A Large-Scale Database for Aesthetic Visual Analysis
    Murray, Naila
    Marchesotti, Luca
    Perronnin, Florent
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2408 - 2415
  • [47] ChEMBL: a large-scale bioactivity database for drug discovery
    Gaulton, Anna
    Bellis, Louisa J.
    Bento, A. Patricia
    Chambers, Jon
    Davies, Mark
    Hersey, Anne
    Light, Yvonne
    McGlinchey, Shaun
    Michalovich, David
    Al-Lazikani, Bissan
    Overington, John P.
    NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D1100 - D1107
  • [48] MalNet: A Large-Scale Image Database of Malicious Software
    Freitas, Scott
    Duggal, Rahul
    Chau, Duen Horng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3948 - 3952
  • [49] Aleda, a free large-scale entity database for French
    Sagot, Benoit
    Stern, Rosa
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1273 - 1276
  • [50] Constructing a Large-Scale Database of Japanese Word Associations
    Joyce, Terry
    GLOTTOMETRICS, 2005, 10 : 82 - 90