Emotion Recognition for Instantaneous Marathi Spoken Words

被引:3
|
作者
Kamble, Vaibhav V. [1 ]
Deshmukh, Ratnadeep R. [2 ]
Karwankar, Anil R. [1 ]
Ratnaparkhe, Varsha R. [1 ]
Annadate, Suresh A. [3 ]
机构
[1] Govt Coll Engn, Dept Elect & Telecommun, Aurangabad, Maharashtra, India
[2] Dr Babasaheb Ambedkar Marathwada Univ, Dept Comp Sci & Informat Technol, Aurangabad, Maharashtra, India
[3] Jawaharlal Nehru Engn Coll, Dept Elect & Telecommun, Aurangabad, Maharashtra, India
关键词
Emotion Recognition; Mel Frequency Cepstral Coefficient; Gaussian mixture models; speaking rate; Marathi Speech Emotional Database;
D O I
10.1007/978-3-319-12012-6_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explore on emotion recognition from Marathi speech signals by using feature extraction techniques and classifier to classify Marathi speech utterances according to their emotional contains. A different type of speech feature vectors contains different emotions, due to their corresponding natures. In this we have categorized the emotions as namely Anger, Happy, Sad, Fear, Neutral and Surprise. Mel Frequency Cepstral Coefficient (MFCC) feature parameters extracted from Marathi speech Signals depend on speaker, spoken word as well as emotion. Gaussian mixture Models (GMM) is used to develop Emotion classification model. In this, recently proposed feature extraction technique and classifier is used for Marathi spoken words. In this each subject/Speaker has spoken 7 Marathi words with 6 different emotions that 7 Marathi words are Aathawan, Aayusha, Chamakdar, Iishara, Manav, Namaskar, and Uupay. For experimental work we have created total 924 Marathi speech utterances database and from this we achieved the empirical performance of overall emotion recognition accuracy rate obtained using MFCC and GMM is 84.61% rate of our Emotion Recognition for Marathi Spoken Words (ERFMSW) system. We got average accuracy for male and female is 86.20% and 83.03% respectively.
引用
收藏
页码:335 / 346
页数:12
相关论文
共 50 条
  • [31] Automatic recognition of spoken words via wavelets coding
    Karam, J
    CISST'03: PROCEEDING OF THE INTERNATIONAL CONFERENCE ON IMAGING SCIENCE, SYSTEMS AND TECHNOLOGY, VOLS 1 AND 2, 2003, : 53 - 56
  • [32] Recognition of spoken words: Semantic effects in lexical access
    Wurm, LH
    Vakoch, DA
    Seaman, SR
    LANGUAGE AND SPEECH, 2004, 47 : 175 - 204
  • [33] Adaptive Artificial Neural Network Based Marathi Speech Database Emotion Recognition
    Palange, Lalita Anil
    Darekar, Raviraj Vishwambhar
    TECHNO-SOCIETAL 2018: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SOCIETAL APPLICATIONS - VOL 2, 2020, : 59 - 67
  • [34] ONLINE RECOGNITION OF SPOKEN WORDS FROM A LARGE VOCABULARY
    KOHONEN, T
    RIITTINEN, H
    REUHKALA, E
    HALTSONEN, S
    INFORMATION SCIENCES, 1984, 33 (1-2) : 3 - 30
  • [35] Dimensionality reduction-based spoken emotion recognition
    Shiqing Zhang
    Xiaoming Zhao
    Multimedia Tools and Applications, 2013, 63 : 615 - 646
  • [36] DEEP MULTIMODAL LEARNING FOR EMOTION RECOGNITION IN SPOKEN LANGUAGE
    Gu, Yue
    Chen, Shuhong
    Marsic, Ivan
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5079 - 5083
  • [37] An emotion space model for recognition of emotions in spoken Chinese
    Jin, XC
    Wang, ZF
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 397 - 402
  • [38] Dimensionality reduction-based spoken emotion recognition
    Zhang, Shiqing
    Zhao, Xiaoming
    MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 63 (03) : 615 - 646
  • [39] Improving speech emotion recognition based on acoustic words emotion dictionary
    Wei, Wang
    Cao, Xinyi
    Li, He
    Shen, Lingjie
    Feng, Yaqin
    Watters, Paul A.
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (06) : 747 - 761
  • [40] The ERP repetition effects on spoken and written Chinese words recognition
    Luo, YJ
    Weng, XC
    Wei, JH
    NEUROIMAGE, 2001, 13 (06) : S562 - S562