Evaluation and Optimization of Perceptually-Based ASR Front-End

被引:11
|
作者
Junqua, Jean-Claude [1 ]
Wakita, Hisashi [2 ]
Hermansky, Hynek [2 ]
机构
[1] Matsushita Elect Ind Co Ltd, Informat Sci Lab, Cent Res Labs, Osaka 570, Japan
[2] Div Panasonic Technol Inc, Speech Technol Lab, Santa Barbara, CA 93105 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1993年 / 1卷 / 01期
关键词
D O I
10.1109/89.221366
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several recently proposed automatic speech recognition (ASR) front-ends are experimentally compared in speaker-dependent, speaker-independent (or cross-speaker) recognition. The perceptually-based linear predictive (PLP) front-end, with the root-power sums (RPS) distance measure, yields generally the highest accuracies, especially in cross-speaker recognition. It is experimentally shown that we can optimize the system and further improve recognition accuracy for speaker-independent recognition by controlling the distance measure's sensitivity to spectral peaks and the spectral tilt and by utilizing the speech dynamic features. For a digit vocabulary, and five reference templates obtained with a clustering algorithm, the optimization improves recognition accuracy from 97% to 98.1%, with respect to the PLP_RPS front-end.
引用
收藏
页码:39 / 48
页数:10
相关论文
共 50 条
  • [1] Evaluation of a wavelet based ASR front-end
    Farooq, Omar
    Datta, Sekharjit
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2007, 5 (04) : 641 - 654
  • [2] A phoneme-similarity based ASR front-end
    Applebaum, TH
    Morin, P
    Hanson, BA
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 33 - 36
  • [3] Perceptually-Based Optimization for Radiometric Projector Compensation
    Akiyama, Ryo
    Fukiage, Taiki
    Nishida, Shin'ya
    2022 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES ABSTRACTS AND WORKSHOPS (VRW 2022), 2022, : 741 - 742
  • [4] Perceptually-based Color Assignment
    Kim, Hye-Rin
    Yoo, Min-Joon
    Kang, Henry
    Lee, In-Kwon
    COMPUTER GRAPHICS FORUM, 2014, 33 (07) : 309 - 318
  • [5] Perceptually-based representation of network diagrams
    Galindo, D
    Faure, C
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 352 - 356
  • [6] A Comparison of Perceptually-Based Metrics for Objective Evaluation of Geometry Processing
    Lavoue, Guillaume
    Corsini, Massimiliano
    IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (07) : 636 - 649
  • [7] Improved ETSI Advanced Front-End for ASR Based on Robust Complex Speech Analysis
    Higa, Keita
    Funaki, Keiichi
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [8] Robust ASR Based on ETSI Advanced Front-End Using Complex Speech Analysis
    Higa, Keita
    Funaki, Keiichi
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2015, E98A (11): : 2211 - 2219
  • [9] Influence of instructions on perceptually-based ratings
    Coquart, J. B. J.
    Raul, P.
    Garcin, M.
    INTERNATIONAL JOURNAL OF SPORTS MEDICINE, 2008, 29 (02) : 151 - 157
  • [10] A PERCEPTUALLY-BASED HEURISTIC CODEBOOK DESIGN ALGORITHM
    DEZHGOSHA, K
    JAMALI, MM
    KWATRA, SC
    1989 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-3, 1989, : 1370 - 1373