Speech Recognition Using Augmented Conditional Random Fields

被引:45
|
作者
Hifny, Yasser [1 ]
Renals, Steve [2 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9LW, Midlothian, Scotland
关键词
Augmented conditional random fields (ACRFs); augmented spaces; discriminative compression; hidden Markov models (HMMs); PHONE RECOGNITION; FEATURES;
D O I
10.1109/TASL.2008.2010286
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoustic modeling based on hidden Markov models (HMMs) is employed by state-of-the-art stochastic speech recognition systems. Although RMMs are a natural choice to warp the time axis and model the temporal phenomena in the speech signal, their conditional independence properties limit their ability to model spectral phenomena well. In this paper, a new acoustic modeling paradigm based on augmented conditional random fields (ACRFs) is investigated and developed. This paradigm addresses some limitations of HMMs while maintaining many of the aspects which have made them successful. In particular, the acoustic modeling problem is reformulated in a data driven, sparse, augmented space to increase discrimination. Acoustic context modeling is explicitly integrated to handle the sequential phenomena of the speech signal. We present an efficient framework for estimating these models that ensures scalability and generality. In the TIMIT phone recognition task, a phone error rate of 23.0% was recorded on the full test set, a significant improvement over comparable HMM-based systems.
引用
收藏
页码:354 / 365
页数:12
相关论文
共 50 条
  • [1] Active Learning for Speech Emotion Recognition Using Conditional Random Fields
    Zhao, Ziping
    Ma, Xirong
    2013 14TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2013), 2013, : 127 - 131
  • [2] Hidden Conditional Random Fields for Visual Speech Recognition
    Pass, Adrian
    Zhang, Jianguo
    Stewart, Darryl
    2009 13TH INTERNATIONAL MACHINE VISION AND IMAGE PROCESSING CONFERENCE, 2009, : 117 - 122
  • [3] Attribute-based Mandarin Speech Recognition using Conditional Random Fields
    Lin, Chi-Yueh
    Wang, Hsiao-Chuan
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 709 - 712
  • [4] DISCRIMINATIVE DURATION MODELING FOR SPEECH RECOGNITION WITH SEGMENTAL CONDITIONAL RANDOM FIELDS
    Kao, Justine T.
    Zweig, Geoffrey
    Nguyen, Patrick
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4476 - 4479
  • [5] Handwritten word recognition using conditional random fields
    Shetty, Shravya
    Srinivasan, Harish
    Srihari, Sargur
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 1098 - 1102
  • [6] Named Entity Recognition using Conditional Random Fields
    Patil, Nita
    Patil, Ajay
    Pawar, B., V
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1181 - 1188
  • [7] Named Entity Recognition Using Conditional Random Fields
    Khan, Wahab
    Daud, Ali
    Shahzad, Khurram
    Amjad, Tehmina
    Banjar, Ameen
    Fasihuddin, Heba
    APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [8] Urdu part of speech tagging using conditional random fields
    Khan, Wahab
    Daud, Ali
    Nasir, Jamal Abdul
    Amjad, Tehmina
    Arafat, Sachi
    Aljohani, Naif
    Alotaibi, Fahd S.
    LANGUAGE RESOURCES AND EVALUATION, 2019, 53 (03) : 331 - 362
  • [9] Urdu part of speech tagging using conditional random fields
    Wahab Khan
    Ali Daud
    Jamal Abdul Nasir
    Tehmina Amjad
    Sachi Arafat
    Naif Aljohani
    Fahd S. Alotaibi
    Language Resources and Evaluation, 2019, 53 : 331 - 362
  • [10] AUTOMATIC SPEECH RECOGNITION USING HIDDEN CONDITIONAL NEURAL FIELDS
    Fujii, Yasuhisa
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5036 - 5039