Speech Recognition Using Augmented Conditional Random Fields

被引:45
|
作者
Hifny, Yasser [1 ]
Renals, Steve [2 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9LW, Midlothian, Scotland
关键词
Augmented conditional random fields (ACRFs); augmented spaces; discriminative compression; hidden Markov models (HMMs); PHONE RECOGNITION; FEATURES;
D O I
10.1109/TASL.2008.2010286
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoustic modeling based on hidden Markov models (HMMs) is employed by state-of-the-art stochastic speech recognition systems. Although RMMs are a natural choice to warp the time axis and model the temporal phenomena in the speech signal, their conditional independence properties limit their ability to model spectral phenomena well. In this paper, a new acoustic modeling paradigm based on augmented conditional random fields (ACRFs) is investigated and developed. This paradigm addresses some limitations of HMMs while maintaining many of the aspects which have made them successful. In particular, the acoustic modeling problem is reformulated in a data driven, sparse, augmented space to increase discrimination. Acoustic context modeling is explicitly integrated to handle the sequential phenomena of the speech signal. We present an efficient framework for estimating these models that ensures scalability and generality. In the TIMIT phone recognition task, a phone error rate of 23.0% was recorded on the full test set, a significant improvement over comparable HMM-based systems.
引用
收藏
页码:354 / 365
页数:12
相关论文
共 50 条
  • [21] Gaussian Conditional Random Fields for Face Recognition
    Smereka, Jonathon M.
    Kumar, B. V. K. Vijaya
    Rodriguez, Andres
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 155 - 162
  • [22] Hidden Conditional Random Fields for Face Recognition
    Yang, Huachun
    2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND APPLICATIONS (CSA), 2013, : 337 - 340
  • [23] Hidden Conditional Random Fields for Gait Recognition
    Hagui, Mabrouka
    Mahjoub, Mohamed Ali
    2016 SECOND INTERNATIONAL IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2016,
  • [24] Contextual Object Recognition with Conditional Random Fields
    Can, Gulcan
    Firat, Orhan
    Vural, Fatos T. Yarman
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [25] Hidden Conditional Random Fields for Face Recognition
    Yang, Huachun
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2012), 2013, 8768
  • [26] CRANDEM: Conditional Random Fields for Word Recognition
    Morris, Jeremy
    Fosler-Lussier, Eric
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 3035 - 3038
  • [27] Protein fold recognition using segmentation conditional random fields (SCRFs)
    Liu, Y
    Carbonell, J
    Weigele, P
    Gopalakrishnan, V
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (02) : 394 - 406
  • [28] Biomedical named entities recognition using conditional random fields model
    Sun, Chengjie
    Guan, Yi
    Wang, Xiaolong
    Lin, Lei
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4223 : 1279 - 1288
  • [29] Kannada Named Entity Recognition and classification using Conditional Random Fields
    Amarappa, S.
    Sathyanarayana, S. V.
    2015 INTERNATIONAL CONFERENCE ON EMERGING RESEARCH IN ELECTRONICS, COMPUTER SCIENCE AND TECHNOLOGY (ICERECT), 2015, : 186 - 191
  • [30] LANGUAGE RECOGNITION USING DEEP-STRUCTURED CONDITIONAL RANDOM FIELDS
    Yu, Dong
    Wang, Shizhen
    Karam, Zahi
    Deng, Li
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5030 - 5033