Nonintrusive speech quality estimation using Gaussian mixture models

被引：25

作者：

Falk, TH ^{[1
]}

Chan, WY ^{[1
]}

机构：

[1] Queens Univ, Dept Elect & Comp Engn, Kingston, ON K7L 3N6, Canada

来源：

IEEE SIGNAL PROCESSING LETTERS | 2006年 / 13卷 / 02期

关键词：

Gaussian mixtures; quality assurance; quality measurement; quality of service; speech coding; speech quality; speech transmission; telephony;

D O I：

10.1109/LSP.2005.861598

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

An algorithm for nonintrusive speech quality estimation based on Gaussian mixture models (GMMs) is presented. GMMs are used to form an artificial reference model of the behavior of features of undegraded speech. Consistency measures between the degraded speech signal and the reference model serve as indicators of speech quality. Consistency values are mapped to an objective speech quality score using a multivariate adaptive regression splines function. When tested on unseen data, the proposed algorithm generally outperforms ITU-T standard P.563, which is the current "state-of-the-art" algorithm. The algorithm computes objective quality scores roughly twice as fast as P.563.

引用

页码：108 / 111

页数：4

共 50 条

[1] Speech enhancement using Maximum A-Posteriori and Gaussian Mixture Models for speech and noise Periodogram estimation
Chehrehsa, Sarang
Moir, Tom James
COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 58 - 71
[2] Speech Enhancement Using Gaussian Scale Mixture Models
Hao, Jiucang
Lee, Te-Won
Sejnowski, Terrence J.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1127 - 1136
[3] Waveform quantization of speech using Gaussian mixture models
Samuelsson, J
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 165 - 168
[4] Emotional speech classification using Gaussian mixture models
Ververidis, D
Kotropoulos, C
2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 2871 - 2874
[5] ON NONINTRUSIVE SPEECH QUALITY ESTIMATION FOR HEARING AIDS
Salehi, Haniyeh
Parsa, Vijay
2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
[6] Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition
Axelrod, Scott
Goel, Vaibhava
Gopinath, Ramesh
Olsen, Peder
Visweswariah, Karthik
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 172 - 189
[7] Age Approximation from Speech using Gaussian Mixture Models
Mittal, Tanushri
Barthwal, Anurag
Koolagudi, Shashidhar G.
2013 SECOND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND SECURITY (ADCONS 2013), 2013, : 74 - 78
[8] Recognition of Emotions in German Speech Using Gaussian Mixture Models
Vondra, Martin
Vich, Robert
MULTIMODAL SIGNAL: COGNITIVE AND ALGORITHMIC ISSUES, 2009, 5398 : 256 - 263
[9] Nonintrusive Speech Quality Estimation Based on Perceptual Linear Prediction
Salehi, Haniyeh
Parsa, Vijay
2016 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2016,
[10] SUBSPACE GAUSSIAN MIXTURE MODELS FOR SPEECH RECOGNITION
Povey, Daniel
Burget, Lukas
Agarwal, Mohit
Akyazi, Pinar
Feng, Kai
Ghoshal, Arnab
Glembek, Ondrej
Goel, Nagendra Kumar
Karafiat, Martin
Rastrow, Ariya
Rose, Richard C.
Schwarz, Petr
Thomas, Samuel
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4330 - 4333

← 1 2 3 4 5 →