Quality Dimensions of Narrowband and Wideband Speech Transmission

被引:29
|
作者
Waeltermann, M. [1 ]
Raake, A. [1 ]
Moeller, S. [1 ]
机构
[1] Berlin Inst Technol, Deutsch Telekom Labs, Qual & Usabil Lab, Berlin, Germany
关键词
INDIVIDUAL-DIFFERENCES; IMPAIRMENT FACTOR; NOISE;
D O I
10.3813/AAA.918370
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The study presented in this paper aims at exploring the perceptual spaces evoked for users of two different telephone scenarios: traditional narrowband speech transmission, and mixed narrowband/wideband speech transmission that may be encountered in today's Voice-over-IP services. Underlying dimensions that constitute the skeleton of these spaces are revealed by auditory experiments, following two different paradigms of judgment: a) Similarity-scaling, and b) Attribute-scaling (Semantic Differential) with subsequent a) Multidimensional Scaling, and b) Principal Component Analysis of a diverse set of stimuli. Similar configurations are obtained which are unequivocally interpretable. Three common dimensions, valid for both the narrowband and the wideband scenario can be identified: "Discontinuity", "Noisiness", and "Coloration". In addition, the wideband space is extended by a further, wideband-specific dimension. Integral listening-quality can well be modeled by means of these dimensions. In both scenarios, "Discontinuity" represents the most important quality feature. The presented work forms the basis for instrumental diagnostic quality measures.
引用
收藏
页码:1090 / 1103
页数:14
相关论文
共 50 条
  • [41] Predicting the quality of enhanced wideband speech with a cochlear model
    Bruce, Ian C. (ibruce@ieee.org), 1600, Acoustical Society of America (142):
  • [42] Quality prediction of synthesized speech based on perceptual quality dimensions
    Norrenbrock, Christoph R.
    Hinterleitner, Florian
    Heute, Ulrich
    Moeller, Sebastian
    SPEECH COMMUNICATION, 2015, 66 : 17 - 35
  • [43] An upper bound on the quality of artificial bandwidth extension of narrowband speech signals
    Jax, P
    Vary, P
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 237 - 240
  • [44] High quality sinusoidal modeling of wideband speech for the purposes of speech synthesis and modification
    Chazan, Dan
    Hoory, Ron
    Sagi, Ariel
    Shechtman, Slava
    Sorin, Alex
    Shuang, Zhi Wei
    Bakis, Raimo
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 877 - 880
  • [45] SPEECH QUALITY OF PCM TRANSMISSION SYSTEM
    HASHIMOTO, K
    SAITO, S
    ELECTRONICS & COMMUNICATIONS IN JAPAN, 1969, 52 (08): : 20 - +
  • [46] An Integrated Narrowband-Wideband Antenna
    Aly, Mostafa G.
    Wang, Yi
    2013 LOUGHBOROUGH ANTENNAS AND PROPAGATION CONFERENCE (LAPC), 2013, : 433 - 435
  • [47] Narrowband channel extraction for wideband receivers
    Welborn, ML
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 1401 - 1404
  • [48] BLIND ESTIMATION OF THE SPEECH TRANSMISSION INDEX FOR SPEECH QUALITY PREDICTION
    Seetharaman, Prem
    Mysore, Gautham J.
    Smaragdis, Paris
    Pardo, Bryan
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 591 - 595
  • [49] Narrowband channel extraction for wideband receivers
    Welborn, Matthew L.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 3 : 1401 - 1404
  • [50] Non-intrusive objective speech quality measurement based on fuzzy GMM and SVR for narrowband speech
    Wang, Jing
    Zhang, Ying
    Zhao, Sheng-Hui
    Kuang, Jing-Ming
    Journal of Beijing Institute of Technology (English Edition), 2010, 19 (01): : 76 - 81