Investigating Graph-based Features for Speech Emotion Recognition

被引:3
|
作者
Pentari, Anastasia [1 ]
Kafentzis, George [2 ]
Tsiknakis, Manolis [3 ,4 ]
机构
[1] Fdn Res & Technol Hellas, Computat BioMed Lab, Iraklion, Greece
[2] Univ Crete, Dept Comp Sci, Iraklion, Greece
[3] Hellen Mediterranean Univ, Biomed Informat & eHlth, Dept Elect & Comp Engn, Iraklion, Greece
[4] Inst Comp Sci, Iraklion, Greece
基金
欧盟地平线“2020”;
关键词
Affective Computing; Emotion Recognition; Speech Analysis; Visibility Graph Theory; Graph-based Features; FREQUENCY-ANALYSIS;
D O I
10.1109/BHI56158.2022.9926795
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
During the last decades, automatic speech emotion recognition (SER) has gained an increased interest by the research community. Specifically, SER aims to recognize the emotional state of a speaker directly from a speech recording. The most prominent approaches in the literature include feature extraction of speech signals in time and/or frequency domain that are successively applied as input into a classification scheme. In this paper, we propose to exploit graph theory and structures as alternative forms of speech representations. We suggest applying the so-called Visibility Graph (VG) theory to represent speech data using an adjacency matrix and extract well-known graph-based features from the latter. Finally, these features are fed into a Support Vector Machine (SVM) classifier in a leave-one-speaker-out, multi-class fashion. Our proposed feature set is compared with a well-known acoustic feature set named the Geneva Minimalistic Acoustic Parameter Set (GeMAPS). We test both approaches on two publicly available speech datasets: SAVEE and EMOVO. The experimental results show that the proposed graph-based features provide better results, namely a classification accuracy of 70% and 98%, respectively, yielding an increase by 29.2% and 60.6%, respectively, when compared to GeMAPS.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Speech emotion recognition via graph-based representations
    Pentari, Anastasia
    Kafentzis, George
    Tsiknakis, Manolis
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [2] Speech emotion recognition via graph-based representations
    Anastasia Pentari
    George Kafentzis
    Manolis Tsiknakis
    Scientific Reports, 14
  • [3] Graph-Based Multi-Feature Fusion Method for Speech Emotion Recognition
    Liu, Xueyu
    Lin, Jie
    Wang, Chao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (16)
  • [4] GRAPH-BASED RECOGNITION OF MORPHOLOGICAL FEATURES
    GAVANKAR, P
    JOURNAL OF INTELLIGENT MANUFACTURING, 1993, 4 (03) : 209 - 218
  • [5] Energy Efficient Graph-Based Hybrid Learning for Speech Emotion Recognition on Humanoid Robot
    Wu, Haowen
    Xu, Hanyue
    Seng, Kah Phooi
    Chen, Jieli
    Ang, Li Minn
    ELECTRONICS, 2024, 13 (06)
  • [6] Graph-based matching for recognition of machined features
    Zhang, L.
    Liu, X.
    Jixie Kexue Yu Jishu/Mechanical Science and Technology, 2001, 20 (06): : 929 - 930
  • [7] New graph-based features for shape recognition
    Mirehi, Narges
    Tahmasbi, Maryam
    Targhi, Alireza Tavakoli
    SOFT COMPUTING, 2021, 25 (11) : 7577 - 7592
  • [8] New graph-based features for shape recognition
    Narges Mirehi
    Maryam Tahmasbi
    Alireza Tavakoli Targhi
    Soft Computing, 2021, 25 : 7577 - 7592
  • [9] Speech Emotion Recognition Based on Arabic Features
    Meddeb, Mohamed
    Karray, Hichem
    Alimi, Adel M.
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 46 - 51
  • [10] Graph-Based Object Semantic Refinement for Visual Emotion Recognition
    Zhang, Jing
    Liu, Xinyu
    Wang, Zhe
    Yang, Hai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3036 - 3049