Evaluation and Optimization of Perceptually-Based ASR Front-End

被引:11
|
作者
Junqua, Jean-Claude [1 ]
Wakita, Hisashi [2 ]
Hermansky, Hynek [2 ]
机构
[1] Matsushita Elect Ind Co Ltd, Informat Sci Lab, Cent Res Labs, Osaka 570, Japan
[2] Div Panasonic Technol Inc, Speech Technol Lab, Santa Barbara, CA 93105 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1993年 / 1卷 / 01期
关键词
D O I
10.1109/89.221366
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several recently proposed automatic speech recognition (ASR) front-ends are experimentally compared in speaker-dependent, speaker-independent (or cross-speaker) recognition. The perceptually-based linear predictive (PLP) front-end, with the root-power sums (RPS) distance measure, yields generally the highest accuracies, especially in cross-speaker recognition. It is experimentally shown that we can optimize the system and further improve recognition accuracy for speaker-independent recognition by controlling the distance measure's sensitivity to spectral peaks and the spectral tilt and by utilizing the speech dynamic features. For a digit vocabulary, and five reference templates obtained with a clustering algorithm, the optimization improves recognition accuracy from 97% to 98.1%, with respect to the PLP_RPS front-end.
引用
收藏
页码:39 / 48
页数:10
相关论文
共 50 条
  • [41] A perceptually-based texture caching algorithm for hardware-based rendering
    Dumont, R
    Pellacini, F
    Ferwerda, JA
    RENDERING TECHNIQUES 2001, 2001, : 249 - +
  • [42] Gamut Mapping in Cinematography Through Perceptually-Based Contrast Modification
    Zamir, Syed Waqas
    Vazquez-Corral, Javier
    Bertalmio, Marcelo
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2014, 8 (03) : 490 - 503
  • [43] FRONT-END PROCESSORS
    STIEFEL, ML
    MINI-MICRO SYSTEMS, 1977, 10 (10): : 58 - &
  • [44] Front-end thinkers
    Carreira Zafra, Cintia
    BORDON-REVISTA DE PEDAGOGIA, 2020, 72 (03): : 176 - 178
  • [45] FRONT-END ALIGNMENT
    FELDMAN, L
    AUDIO, 1969, 53 (05): : 30 - &
  • [46] Parallelism in the front-end
    Oberoi, PS
    Sohi, GS
    30TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2003, : 230 - 240
  • [47] RF Front-end Based on Microwave Photonics
    Zhu, Dan
    Chen, Wenjuan
    Chen, Zhiwen
    Du, Tianhua
    Tang, Zhenzhou
    Pan, Shilong
    2017 OPTO-ELECTRONICS AND COMMUNICATIONS CONFERENCE (OECC) AND PHOTONICS GLOBAL CONFERENCE (PGC), 2017,
  • [48] Bottleneck Based Front-End for Diarization Systems
    Vinals, Ignacio
    Villalba, Jesus
    Ortega, Alfonso
    Miguel, Antonio
    Lleida, Eduardo
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2016, 2016, 10077 : 276 - 286
  • [49] A fast approach for perceptually-based fitting strokes into elliptical arcs
    Pedro Company
    Raquel Plumed
    Peter A. C. Varley
    The Visual Computer, 2015, 31 : 775 - 785
  • [50] 'FRONT-END LOADER'
    SALOM, P
    WESTERLY, 1988, 33 (02): : 101 - 101