High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition

被引:1
|
作者
Jiang Zhipeng [1 ]
Huang Chengwei [2 ]
机构
[1] Jinling Inst Technol, Sch Elect & Informat Engn, Nanjing, Jiangsu, Peoples R China
[2] Soochow Univ, Coll Phys Optoelect & Energy, Suzhou, Peoples R China
关键词
High-order Markov random fields; speech emotion recognition; cross-database recognition; dimensional emotion model;
D O I
10.1515/cait-2015-0054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we study the cross-language speech emotion recognition using high-order Markov random fields, especially the application in Vietnamese speech emotion recognition. First, we extract the basic speech features including pitch frequency, formant frequency and short-term intensity. Based on the low level descriptor we further construct the statistic features including maximum, minimum, mean and standard deviation. Second, we adopt the high-order Markov random fields (MRF) to optimize the cross-language speech emotion model. The dimensional restrictions may be modeled by MRF. Third, based on the Vietnamese and Chinese database we analyze the efficiency of our emotion recognition system. We adopt the dimensional emotion model (arousal-valence) to verify the efficiency of MRF configuration method. The experimental results show that the high-order Markov random fields can improve the dimensional emotion recognition in the cross-language experiments, and the configuration method shows promising robustness over different languages.
引用
收藏
页码:50 / 57
页数:8
相关论文
共 50 条
  • [1] A study on high-order hidden Markov models and applications to speech recognition
    Lee, Lee-Min
    Lee, Jia-Chien
    ADVANCES IN APPLIED ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4031 : 682 - 690
  • [2] Piecewise polynomial high-order hidden Markov models with applications in speech recognition
    Lee, Lee-Min
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2016, : 323 - 327
  • [3] PIECEWISE LINEAR HIGH-ORDER HIDDEN MARKOV MODELS AND APPLICATIONS TO SPEECH RECOGNITION
    Lee, Lee-Min
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL. 1, 2015, : 383 - 388
  • [4] High-order hidden Markov model for piecewise linear processes and applications to speech recognition
    Lee, Lee-Min
    Jean, Fu-Rong
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (02): : EL204 - EL210
  • [5] High-order hidden Markov model for piecewise linear processes and applications to speech recognition
    Lee, Lee-Min
    Jean, Fu-Rong
    Journal of the Acoustical Society of America, 2016, 140 (02):
  • [6] Cross-language speech emotion recognition in German and Chinese
    School of Information Science and Engineering, Southeast University, No. 2, Si Pai Lou, Nanjing 210096, China
    不详
    不详
    Huang, C. (Huang.Chengwei1@gmail.com), 2012, ICIC Express Letters Office, Tokai University, Kumamoto Campus, 9-1-1, Toroku, Kumamoto, 862-8652, Japan (06):
  • [7] Cross-language speech emotion recognition in German and Chinese
    Huang, Chengwei
    Han, Dong
    Bao, Yongqiang
    Yu, Hua
    Zhao, Li
    ICIC Express Letters, 2012, 6 (08): : 2141 - 2146
  • [9] Optimal learning high-order Markov random fields priors of colour image
    Zhang, Ke
    Jin, Huidong
    Fu, Zhouyu
    Liu, Nianjun
    COMPUTER VISION - ACCV 2007, PT I, PROCEEDINGS, 2007, 4843 : 482 - 491
  • [10] Texture modelling with nested high-order Markov-Gibbs random fields
    Versteegen, Ralph
    Gimel'farb, Georgy
    Riddle, Patricia
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 143 : 120 - 134