Speech Recognition Using Augmented Conditional Random Fields

被引:45
|
作者
Hifny, Yasser [1 ]
Renals, Steve [2 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9LW, Midlothian, Scotland
关键词
Augmented conditional random fields (ACRFs); augmented spaces; discriminative compression; hidden Markov models (HMMs); PHONE RECOGNITION; FEATURES;
D O I
10.1109/TASL.2008.2010286
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoustic modeling based on hidden Markov models (HMMs) is employed by state-of-the-art stochastic speech recognition systems. Although RMMs are a natural choice to warp the time axis and model the temporal phenomena in the speech signal, their conditional independence properties limit their ability to model spectral phenomena well. In this paper, a new acoustic modeling paradigm based on augmented conditional random fields (ACRFs) is investigated and developed. This paradigm addresses some limitations of HMMs while maintaining many of the aspects which have made them successful. In particular, the acoustic modeling problem is reformulated in a data driven, sparse, augmented space to increase discrimination. Acoustic context modeling is explicitly integrated to handle the sequential phenomena of the speech signal. We present an efficient framework for estimating these models that ensures scalability and generality. In the TIMIT phone recognition task, a phone error rate of 23.0% was recorded on the full test set, a significant improvement over comparable HMM-based systems.
引用
收藏
页码:354 / 365
页数:12
相关论文
共 50 条
  • [31] Recognition of bacteria named entity using conditional random fields in Spark
    Wang, Xiaoyan
    Li, Yichuan
    He, Tingting
    Jiang, Xingpeng
    Hu, Xiaohua
    BMC SYSTEMS BIOLOGY, 2018, 12
  • [32] Hidden Conditional Random Fields for Action Recognition
    Chen, Lifang
    van der Aa, Nico
    Tan, Robby T.
    Veltkamp, Remco C.
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 240 - 247
  • [33] BIOMEDICAL NAMED ENTITY RECOGNITION USING SECONDORDER CONDITIONAL RANDOM FIELDS
    Thipcharoen, Supattanawaree
    Subpaiboonkit, Sitthichoke
    Chaijaruwanich, Jeerayut
    2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 2, 2012, : 397 - 401
  • [34] Chinese Unknown Word Recognition using improved Conditional Random Fields
    Xu, Yisu
    Wang, Xuan
    Tang, Buzhou
    Wang, Xiaolong
    ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, PROCEEDINGS, 2008, : 363 - 367
  • [35] Hadoop Recognition of Biomedical Named Entity Using Conditional Random Fields
    Li, Kenli
    Ai, Wei
    Tang, Zhuo
    Zhang, Fan
    Jiang, Lingang
    Li, Keqin
    Hwang, Kai
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (11) : 3040 - 3051
  • [36] Viewpoint Insensitive Actions Recognition Using Hidden Conditional Random Fields
    Ji, Xiaofei
    Liu, Honghai
    Li, Yibo
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT I, 2010, 6276 : 369 - +
  • [37] Speech Synthesis Based on Gaussian Conditional Random Fields
    Khorram, Soheil
    Bahmaninezhad, Fahimeh
    Sameti, Hossein
    ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING, AISP 2013, 2014, 427 : 183 - 193
  • [38] Conditional Random Fields in Speech, Audio, and Language Processing
    Fosler-Lussier, Eric
    He, Yanzhang
    Jyothi, Preethi
    Prabhavalkar, Rohit
    PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1054 - 1075
  • [39] SPEECH RECOGNITION WITH SEGMENTAL CONDITIONAL RANDOM FIELDS: A SUMMARY OF THE JHU CLSP 2010 SUMMER WORKSHOP
    Zweig, G.
    Nguyen, P.
    Van Compernolle, D.
    Demuynck, K.
    Atlas, L.
    Clark, P.
    Sell, G.
    Wang, M.
    Sha, F.
    Hermansky, H.
    Karakos, D.
    Jansen, A.
    Thomas, S.
    S., S. G. S. V.
    Bowman, S.
    Kao, J.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5044 - 5047
  • [40] Face recognition using Hidden Conditional Random fields and Support Vector Machine
    Yang, Huachun
    2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND APPLICATIONS (CSA), 2013, : 341 - 344