Nonintrusive speech quality estimation using Gaussian mixture models

被引:25
|
作者
Falk, TH [1 ]
Chan, WY [1 ]
机构
[1] Queens Univ, Dept Elect & Comp Engn, Kingston, ON K7L 3N6, Canada
关键词
Gaussian mixtures; quality assurance; quality measurement; quality of service; speech coding; speech quality; speech transmission; telephony;
D O I
10.1109/LSP.2005.861598
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An algorithm for nonintrusive speech quality estimation based on Gaussian mixture models (GMMs) is presented. GMMs are used to form an artificial reference model of the behavior of features of undegraded speech. Consistency measures between the degraded speech signal and the reference model serve as indicators of speech quality. Consistency values are mapped to an objective speech quality score using a multivariate adaptive regression splines function. When tested on unseen data, the proposed algorithm generally outperforms ITU-T standard P.563, which is the current "state-of-the-art" algorithm. The algorithm computes objective quality scores roughly twice as fast as P.563.
引用
收藏
页码:108 / 111
页数:4
相关论文
共 50 条
  • [1] Speech enhancement using Maximum A-Posteriori and Gaussian Mixture Models for speech and noise Periodogram estimation
    Chehrehsa, Sarang
    Moir, Tom James
    COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 58 - 71
  • [2] Speech Enhancement Using Gaussian Scale Mixture Models
    Hao, Jiucang
    Lee, Te-Won
    Sejnowski, Terrence J.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1127 - 1136
  • [3] Waveform quantization of speech using Gaussian mixture models
    Samuelsson, J
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 165 - 168
  • [4] Emotional speech classification using Gaussian mixture models
    Ververidis, D
    Kotropoulos, C
    2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 2871 - 2874
  • [5] ON NONINTRUSIVE SPEECH QUALITY ESTIMATION FOR HEARING AIDS
    Salehi, Haniyeh
    Parsa, Vijay
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [6] Discriminative estimation of subspace constrained Gaussian mixture models for speech recognition
    Axelrod, Scott
    Goel, Vaibhava
    Gopinath, Ramesh
    Olsen, Peder
    Visweswariah, Karthik
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 172 - 189
  • [7] Age Approximation from Speech using Gaussian Mixture Models
    Mittal, Tanushri
    Barthwal, Anurag
    Koolagudi, Shashidhar G.
    2013 SECOND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND SECURITY (ADCONS 2013), 2013, : 74 - 78
  • [8] Recognition of Emotions in German Speech Using Gaussian Mixture Models
    Vondra, Martin
    Vich, Robert
    MULTIMODAL SIGNAL: COGNITIVE AND ALGORITHMIC ISSUES, 2009, 5398 : 256 - 263
  • [9] Nonintrusive Speech Quality Estimation Based on Perceptual Linear Prediction
    Salehi, Haniyeh
    Parsa, Vijay
    2016 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2016,
  • [10] SUBSPACE GAUSSIAN MIXTURE MODELS FOR SPEECH RECOGNITION
    Povey, Daniel
    Burget, Lukas
    Agarwal, Mohit
    Akyazi, Pinar
    Feng, Kai
    Ghoshal, Arnab
    Glembek, Ondrej
    Goel, Nagendra Kumar
    Karafiat, Martin
    Rastrow, Ariya
    Rose, Richard C.
    Schwarz, Petr
    Thomas, Samuel
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4330 - 4333