Evaluation and Optimization of Perceptually-Based ASR Front-End

被引:11
|
作者
Junqua, Jean-Claude [1 ]
Wakita, Hisashi [2 ]
Hermansky, Hynek [2 ]
机构
[1] Matsushita Elect Ind Co Ltd, Informat Sci Lab, Cent Res Labs, Osaka 570, Japan
[2] Div Panasonic Technol Inc, Speech Technol Lab, Santa Barbara, CA 93105 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1993年 / 1卷 / 01期
关键词
D O I
10.1109/89.221366
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several recently proposed automatic speech recognition (ASR) front-ends are experimentally compared in speaker-dependent, speaker-independent (or cross-speaker) recognition. The perceptually-based linear predictive (PLP) front-end, with the root-power sums (RPS) distance measure, yields generally the highest accuracies, especially in cross-speaker recognition. It is experimentally shown that we can optimize the system and further improve recognition accuracy for speaker-independent recognition by controlling the distance measure's sensitivity to spectral peaks and the spectral tilt and by utilizing the speech dynamic features. For a digit vocabulary, and five reference templates obtained with a clustering algorithm, the optimization improves recognition accuracy from 97% to 98.1%, with respect to the PLP_RPS front-end.
引用
收藏
页码:39 / 48
页数:10
相关论文
共 50 条
  • [31] The KidsRoom:: A perceptually-based interactive and immersive story environment
    Bobick, AF
    Intille, SS
    Davis, JW
    Baird, F
    Pinhanez, CS
    Campbell, LW
    Ivanov, YA
    Schütte, A
    Wilson, A
    PRESENCE-VIRTUAL AND AUGMENTED REALITY, 1999, 8 (04): : 369 - 393
  • [32] Perceptually-based objective measures for speech quality assessment
    Takroni, Y
    Meky, M
    Saadawi, T
    INTELLIGENT SYSTEMS, 1997, : 199 - 202
  • [33] Pipe support optimization aids front-end design
    不详
    HYDROCARBON PROCESSING, 2002, 81 (04): : 33 - 33
  • [34] Noise optimization and design of PHS front-end in CMOS
    Feng, Dong
    Shi, Bingxue
    IASTED International Conference on Wireless Networks and Emerging Technologies, 2005, : 26 - 30
  • [35] ACOUSTIC FRONT-END OPTIMIZATION FOR BIRD SPECIES RECOGNITION
    Graciarena, Martin
    Delplanche, Michelle
    Shriberg, Elizabeth
    Stolcke, Andreas
    Ferrer, Luciana
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 293 - 296
  • [36] Bacterial Foraging Based Algorithm Front-end to Solve Global Optimization Problems
    Hernandez-Ocana, Betania
    Garcia-Lopez, Adrian
    Hernandez-Torruco, Jose
    Chavez-Bosquez, Oscar
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 32 (03): : 1795 - 1813
  • [37] Optimization of the front-end logistics routes of agricultural products based on network platform
    Lingjuan T.
    Linhong L.
    Menghan L.
    International Journal for Engineering Modelling, 2018, 31 (04) : 1 - 14
  • [38] Texture Measuring by Means of Perceptually-Based Fineness Functions
    Chamorro-Martinez, J.
    Martinez-Jimenez, P.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2009, 5524 : 265 - 272
  • [39] A perceptually-based theory of mind for agent interaction initiation
    Peters, Christopher
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2006, 3 (03) : 321 - 339
  • [40] Gamut Mapping through Perceptually-Based Contrast Reduction
    Zamir, Syed Waqas
    Vazquez-Corral, Javier
    Bertalmio, Marcelo
    IMAGE AND VIDEO TECHNOLOGY, PSIVT 2013, 2014, 8333 : 1 - 11