High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition

被引:1
|
作者
Jiang Zhipeng [1 ]
Huang Chengwei [2 ]
机构
[1] Jinling Inst Technol, Sch Elect & Informat Engn, Nanjing, Jiangsu, Peoples R China
[2] Soochow Univ, Coll Phys Optoelect & Energy, Suzhou, Peoples R China
关键词
High-order Markov random fields; speech emotion recognition; cross-database recognition; dimensional emotion model;
D O I
10.1515/cait-2015-0054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we study the cross-language speech emotion recognition using high-order Markov random fields, especially the application in Vietnamese speech emotion recognition. First, we extract the basic speech features including pitch frequency, formant frequency and short-term intensity. Based on the low level descriptor we further construct the statistic features including maximum, minimum, mean and standard deviation. Second, we adopt the high-order Markov random fields (MRF) to optimize the cross-language speech emotion model. The dimensional restrictions may be modeled by MRF. Third, based on the Vietnamese and Chinese database we analyze the efficiency of our emotion recognition system. We adopt the dimensional emotion model (arousal-valence) to verify the efficiency of MRF configuration method. The experimental results show that the high-order Markov random fields can improve the dimensional emotion recognition in the cross-language experiments, and the configuration method shows promising robustness over different languages.
引用
收藏
页码:50 / 57
页数:8
相关论文
共 50 条
  • [31] Comparison of cross-language generalisation following speech therapy
    Holm, A
    Dodd, B
    FOLIA PHONIATRICA ET LOGOPAEDICA, 2001, 53 (03) : 166 - 172
  • [32] Cognitive influences on cross-language speech perception in infancy
    Lalonde, CE
    Werker, JF
    INFANT BEHAVIOR & DEVELOPMENT, 1995, 18 (04): : 459 - 475
  • [33] Cross-language priming: A view from bilingual speech
    Travis, Catherine E.
    Cacoullos, Rena Torres
    Kidd, Evan
    BILINGUALISM-LANGUAGE AND COGNITION, 2017, 20 (02) : 283 - 298
  • [34] DEVELOPMENTAL ASPECTS OF CROSS-LANGUAGE SPEECH-PERCEPTION
    WERKER, JF
    GILBERT, JHV
    HUMPHREY, K
    TEES, RC
    CHILD DEVELOPMENT, 1981, 52 (01) : 349 - 355
  • [35] CROSS-LANGUAGE STUDY OF SPEECH-PATTERN LEARNING
    SIMON, C
    FOURCIN, AJ
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 (03): : 925 - 935
  • [36] CROSS-LANGUAGE SPEECH DEPENDENT LIP-SYNCHRONIZATION
    Jha, Abhishek
    Voleti, Vikram
    Namboodiri, Vinay
    Jawahar, C. V.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7140 - 7144
  • [37] Cross-language differences in cue use for speech segmentation
    Tyler, Michael D.
    Cutler, Anne
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (01): : 367 - 376
  • [38] Cross-Language Transfer Lear ning-based Lhasa-Tibetan Speech Recognition
    Wang, Zhijie
    Zhao, Yue
    Wu, Licheng
    Bi, Xiaojun
    Dawa, Zhuoma
    Ji, Qiang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 629 - 639
  • [39] Cross-language speech retrieval: Establishing a baseline performance
    Sheridan, P
    Wechsler, M
    Schauble, P
    PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1997, : 99 - 108
  • [40] High-Order Markov Random Field Based Image Registration for Pulmonary CT
    Xue, Peng
    Dong, Enqing
    Ji, Huizhong
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS, MIUA 2019, 2020, 1065 : 339 - 350