A Segment-Based Non-Parametric Approach for Monophone Recognition

被引:0
|
作者
Golipour, Ladan [1 ]
O'Shaughnessy, Douglas [1 ]
机构
[1] INRS EMT, Montreal, PQ, Canada
关键词
phoneme recognition; nonparametric density estimation; phoneme segmentation; SPEECH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a segment-based non-parametric method of monophone recognition. We pre-segment the speech utterance into its underlying phonemes using a group-delay-based algorithm. Then, we apply the k-NN/SASH phoneme classification technique to classify the hypothesized phonemes. Since phoneme boundaries are already known during the decoding, the search space is very limited and the recognition fast. However, such hard-decisioning leads to missed boundaries and over-segmentations. Therefore, while constructing the graph for an utterance, we use phoneme duration constraints and broad-class similarity information to merge or split the segments and create new branches. We perform a simplified acoustical level monophone recognition task on the TIMIT test database. Since phoneme transitional probabilities are not included, only one (most likely) hypothesis and score is provided for each segment and a simple shortest path search algorithm is applied to find the best phoneme sequence rather than the Viterbi search. This simplified evaluation achieves 58.5% accuracy and 67.8% correctness.
引用
收藏
页码:2334 / 2337
页数:4
相关论文
共 50 条
  • [1] Segment-based approach to the recognition of emotions in speech
    Shami, MT
    Kamel, MS
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 366 - 369
  • [2] Industry, corporate, and segment effects and business performance: A non-parametric approach
    Ruefli, TW
    Wiggins, RR
    STRATEGIC MANAGEMENT JOURNAL, 2003, 24 (09) : 861 - 879
  • [3] Segment and combine approach for non-parametric time-series classification
    Geurts, P
    Wehenkel, L
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005, 2005, 3721 : 478 - 485
  • [4] A NON-PARAMETRIC ANALYSIS OF RECOGNITION EXPERIMENTS
    POLLACK, I
    NORMAN, DA
    PSYCHONOMIC SCIENCE, 1964, 1 (05): : 125 - 126
  • [5] AN ALGORITHM FOR NON-PARAMETRIC PATTERN RECOGNITION
    SEBESTYEN, G
    EDIE, J
    IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1966, EC15 (06): : 908 - +
  • [6] A probabilistic framework for segment-based speech recognition
    Glass, JR
    COMPUTER SPEECH AND LANGUAGE, 2003, 17 (2-3): : 137 - 152
  • [7] Structural sensitivity analysis based on a hybrid parametric and non-parametric approach
    Mengus, D.
    Ouisse, M.
    Cogan, S.
    Lefebvre, X.
    PROCEEDINGS OF ISMA 2008: INTERNATIONAL CONFERENCE ON NOISE AND VIBRATION ENGINEERING, VOLS. 1-8, 2008, : 3911 - 3925
  • [8] Timing Levels in Segment-Based Speech Emotion Recognition
    Schuller, Bjoern
    Rigoll, Gerhard
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1818 - 1821
  • [9] AN EFFICIENT NON-PARAMETRIC ANALYSIS OF RECOGNITION MEMORY
    POLLACK, I
    NORMAN, DA
    GALANTER, E
    PSYCHONOMIC SCIENCE, 1964, 1 (11): : 327 - 328
  • [10] A non-parametric approach to simplicity clustering
    Hines, Peter
    Pothos, Emmanuel M.
    Chater, Nick
    APPLIED ARTIFICIAL INTELLIGENCE, 2007, 21 (08) : 729 - 752