High-Order Markov Random Fields and Their Applications in Cross-Language Speech Recognition

被引：1

作者：

Jiang Zhipeng ^{[1
]}

Huang Chengwei ^{[2
]}

机构：

[1] Jinling Inst Technol, Sch Elect & Informat Engn, Nanjing, Jiangsu, Peoples R China

[2] Soochow Univ, Coll Phys Optoelect & Energy, Suzhou, Peoples R China

来源：

CYBERNETICS AND INFORMATION TECHNOLOGIES | 2015年 / 15卷 / 04期

关键词：

High-order Markov random fields; speech emotion recognition; cross-database recognition; dimensional emotion model;

D O I：

10.1515/cait-2015-0054

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper we study the cross-language speech emotion recognition using high-order Markov random fields, especially the application in Vietnamese speech emotion recognition. First, we extract the basic speech features including pitch frequency, formant frequency and short-term intensity. Based on the low level descriptor we further construct the statistic features including maximum, minimum, mean and standard deviation. Second, we adopt the high-order Markov random fields (MRF) to optimize the cross-language speech emotion model. The dimensional restrictions may be modeled by MRF. Third, based on the Vietnamese and Chinese database we analyze the efficiency of our emotion recognition system. We adopt the dimensional emotion model (arousal-valence) to verify the efficiency of MRF configuration method. The experimental results show that the high-order Markov random fields can improve the dimensional emotion recognition in the cross-language experiments, and the configuration method shows promising robustness over different languages.

引用

页码：50 / 57

页数：8

共 50 条

[21] A CROSS-LANGUAGE PERSPECTIVE ON SPEECH INFORMATION RATE
Pellegrino, Francois
Coupe, Christophe
Marsico, Egidio
LANGUAGE, 2011, 87 (03) : 539 - 558
[22] CROSS-LANGUAGE STUDY OF SPEECH PATTERN LEARNING
SIMON, C
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 61 : S64 - S64
[23] Cross-language Speech Attribute Detection and Phone Recognition for Tibetan Using Deep Learning
Wang, Hui
Zhao, Yue
Xu, Yanmin
Xu, Xiaona
Suo, Xingmei
Ji, Qiang
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 474 - +
[24] Shared Speech Attribute Augmentation for English-Tibetan Cross-language Phone Recognition
Zhao, Yue
Zhou, Nan
Zhang, Libing
Wu, Licheng
Zheng, Rui
Wang, Xiaoyang
Ji, Qiang
2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 539 - 543
[25] Linguistic disparities in cross-language automatic speech recognition transfer from Arabic to Tashlhiyt
Zellou, Georgia
Lahrouchi, Mohamed
SCIENTIFIC REPORTS, 2024, 14 (01)
[26] Linguistic disparities in cross-language automatic speech recognition transfer from Arabic to Tashlhiyt
Georgia Zellou
Mohamed Lahrouchi
Scientific Reports, 14
[27] Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition
Latif, Siddique
Rana, Rajib
Khalifa, Sara
Jurdak, Raja
Schuller, Bjorn
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 1912 - 1926
[28] Onboard Contextual Classification of 3-D Point Clouds with Learned High-order Markov Random Fields
Munoz, Daniel
Vandapel, Nicolas
Hebert, Martial
ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 4273 - 4280
[29] Cross-Language Activation Begins During Speech Planning and Extends Into Second Language Speech
Jacobs, April
Fricke, Melinda
Kroll, Judith F.
LANGUAGE LEARNING, 2016, 66 (02) : 324 - 353
[30] A PARALLEL-PROCESSING ALGORITHM FOR SPEECH RECOGNITION USING MARKOV RANDOM-FIELDS
NODA, H
SHIRAZI, MN
ZHANG, B
SYSTEMS AND COMPUTERS IN JAPAN, 1994, 25 (02) : 92 - 100

← 1 2 3 4 5 →