High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition

被引:1
|
作者
Jiang Zhipeng [1 ]
Huang Chengwei [2 ]
机构
[1] Jinling Inst Technol, Sch Elect & Informat Engn, Nanjing, Jiangsu, Peoples R China
[2] Soochow Univ, Coll Phys Optoelect & Energy, Suzhou, Peoples R China
关键词
High-order Markov random fields; speech emotion recognition; cross-database recognition; dimensional emotion model;
D O I
10.1515/cait-2015-0054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we study the cross-language speech emotion recognition using high-order Markov random fields, especially the application in Vietnamese speech emotion recognition. First, we extract the basic speech features including pitch frequency, formant frequency and short-term intensity. Based on the low level descriptor we further construct the statistic features including maximum, minimum, mean and standard deviation. Second, we adopt the high-order Markov random fields (MRF) to optimize the cross-language speech emotion model. The dimensional restrictions may be modeled by MRF. Third, based on the Vietnamese and Chinese database we analyze the efficiency of our emotion recognition system. We adopt the dimensional emotion model (arousal-valence) to verify the efficiency of MRF configuration method. The experimental results show that the high-order Markov random fields can improve the dimensional emotion recognition in the cross-language experiments, and the configuration method shows promising robustness over different languages.
引用
收藏
页码:50 / 57
页数:8
相关论文
共 50 条
  • [11] Cross-language Transfer Speech Recognition using Deep Learning
    Zhao, Yue
    Xu, Yan M.
    Sun, Mei J.
    Xu, Xiao N.
    Wang, Hui
    Yang, Guo S.
    Ji, Qiang
    11TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2014, : 1422 - 1426
  • [12] Cross-language use of acoustic information for automatic speech recognition
    Nieuwoudt, C
    Botha, EC
    SPEECH COMMUNICATION, 2002, 38 (1-2) : 101 - 113
  • [13] Cross-language adaptation of acoustic models in automatic speech recognition
    Univ of Pretoria, Pretoria, South Africa
    IEEE AFRICON Conf, (181-184):
  • [14] High-order Markov Random Fields-Based Compressed Sensing for Multispectral Reconstruction
    Huang, Yukun
    Wei, Jingbo
    Yue, Shasha
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 7208 - 7211
  • [15] Cross-Language Speech Emotion Recognition Via Multiple Kernel Learning
    Zha, Cheng
    2019 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2019, : 208 - 209
  • [16] Cross-language analysis of stuttered speech
    Rezaei-Aghbash, N
    Whiteside, SP
    Cudd, P
    JOURNAL OF FLUENCY DISORDERS, 2000, 25 (03) : 248 - 249
  • [17] Chinese-English bilingual phone modeling for cross-language speech recognition
    Yu, SM
    Zhang, SW
    Xu, B
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 917 - 920
  • [18] Correlation properties of the random linear high-order Markov chains
    Vekslerchik, V. E.
    Pritula, G. M.
    Melnik, S. S.
    Usatenko, O., V
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 528
  • [19] Automatic inference of articulated spine models in CT images using high-order Markov Random Fields
    Kadoury, Samuel
    Labelle, Hubert
    Paragios, Nikos
    MEDICAL IMAGE ANALYSIS, 2011, 15 (04) : 426 - 437
  • [20] Models of dataset size, question design, and cross-language speech perception for speech crowdsourcing applications
    Hasegawa-Johnson, Mark
    Cole, Jennifer
    Jyothi, Preethi
    Varshney, Lav R.
    LABORATORY PHONOLOGY, 2015, 6 (3-4): : 381 - 431