High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition

被引：1

作者：

Jiang Zhipeng ^{[1
]}

Huang Chengwei ^{[2
]}

机构：

[1] Jinling Inst Technol, Sch Elect & Informat Engn, Nanjing, Jiangsu, Peoples R China

[2] Soochow Univ, Coll Phys Optoelect & Energy, Suzhou, Peoples R China

来源：

CYBERNETICS AND INFORMATION TECHNOLOGIES | 2015年 / 15卷 / 04期

关键词：

High-order Markov random fields; speech emotion recognition; cross-database recognition; dimensional emotion model;

D O I：

10.1515/cait-2015-0054

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper we study the cross-language speech emotion recognition using high-order Markov random fields, especially the application in Vietnamese speech emotion recognition. First, we extract the basic speech features including pitch frequency, formant frequency and short-term intensity. Based on the low level descriptor we further construct the statistic features including maximum, minimum, mean and standard deviation. Second, we adopt the high-order Markov random fields (MRF) to optimize the cross-language speech emotion model. The dimensional restrictions may be modeled by MRF. Third, based on the Vietnamese and Chinese database we analyze the efficiency of our emotion recognition system. We adopt the dimensional emotion model (arousal-valence) to verify the efficiency of MRF configuration method. The experimental results show that the high-order Markov random fields can improve the dimensional emotion recognition in the cross-language experiments, and the configuration method shows promising robustness over different languages.

引用

页码：50 / 57

页数：8

共 50 条

[11] Cross-language Transfer Speech Recognition using Deep Learning
Zhao, Yue
Xu, Yan M.
Sun, Mei J.
Xu, Xiao N.
Wang, Hui
Yang, Guo S.
Ji, Qiang
11TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2014, : 1422 - 1426
[12] Cross-language use of acoustic information for automatic speech recognition
Nieuwoudt, C
Botha, EC
SPEECH COMMUNICATION, 2002, 38 (1-2) : 101 - 113
[13] Cross-language adaptation of acoustic models in automatic speech recognition
Univ of Pretoria, Pretoria, South Africa
IEEE AFRICON Conf, (181-184):
[14] High-order Markov Random Fields-Based Compressed Sensing for Multispectral Reconstruction
Huang, Yukun
Wei, Jingbo
Yue, Shasha
2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 7208 - 7211
[15] Cross-Language Speech Emotion Recognition Via Multiple Kernel Learning
Zha, Cheng
2019 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2019, : 208 - 209
[16] Cross-language analysis of stuttered speech
Rezaei-Aghbash, N
Whiteside, SP
Cudd, P
JOURNAL OF FLUENCY DISORDERS, 2000, 25 (03) : 248 - 249
[17] Chinese-English bilingual phone modeling for cross-language speech recognition
Yu, SM
Zhang, SW
Xu, B
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 917 - 920
[18] Correlation properties of the random linear high-order Markov chains
Vekslerchik, V. E.
Pritula, G. M.
Melnik, S. S.
Usatenko, O., V
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 528
[19] Automatic inference of articulated spine models in CT images using high-order Markov Random Fields
Kadoury, Samuel
Labelle, Hubert
Paragios, Nikos
MEDICAL IMAGE ANALYSIS, 2011, 15 (04) : 426 - 437
[20] Models of dataset size, question design, and cross-language speech perception for speech crowdsourcing applications
Hasegawa-Johnson, Mark
Cole, Jennifer
Jyothi, Preethi
Varshney, Lav R.
LABORATORY PHONOLOGY, 2015, 6 (3-4): : 381 - 431

← 1 2 3 4 5 →