Speech Recognition Using Augmented Conditional Random Fields

被引:45
|
作者
Hifny, Yasser [1 ]
Renals, Steve [2 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9LW, Midlothian, Scotland
关键词
Augmented conditional random fields (ACRFs); augmented spaces; discriminative compression; hidden Markov models (HMMs); PHONE RECOGNITION; FEATURES;
D O I
10.1109/TASL.2008.2010286
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoustic modeling based on hidden Markov models (HMMs) is employed by state-of-the-art stochastic speech recognition systems. Although RMMs are a natural choice to warp the time axis and model the temporal phenomena in the speech signal, their conditional independence properties limit their ability to model spectral phenomena well. In this paper, a new acoustic modeling paradigm based on augmented conditional random fields (ACRFs) is investigated and developed. This paradigm addresses some limitations of HMMs while maintaining many of the aspects which have made them successful. In particular, the acoustic modeling problem is reformulated in a data driven, sparse, augmented space to increase discrimination. Acoustic context modeling is explicitly integrated to handle the sequential phenomena of the speech signal. We present an efficient framework for estimating these models that ensures scalability and generality. In the TIMIT phone recognition task, a phone error rate of 23.0% was recorded on the full test set, a significant improvement over comparable HMM-based systems.
引用
收藏
页码:354 / 365
页数:12
相关论文
共 50 条
  • [41] Human action recognition using manifold learning and hidden conditional random fields
    Liu, Fa-Wang
    Jia, Yun-De
    Ruan Jian Xue Bao/Journal of Software, 2008, 19 (SUPPL.): : 69 - 77
  • [42] Portuguese Named Entity Recognition using Conditional Random Fields and Local Grammars
    Pirovani, Juliana P. C.
    de Oliveira, Elias
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4452 - 4456
  • [43] Human Activity Recognition Using Gaussian Mixture Hidden Conditional Random Fields
    Siddiqi, Muhammad Hameed
    Alruwaili, Madallah
    Ali, Amjad
    Alanazi, Saad
    Zeshan, Furkh
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
  • [44] Section heading recognition in electronic health records using conditional random fields
    Chen, Chih-Wei
    Chang, Nai-Wen
    Chang, Yung-Chun
    Dai, Hong-Jie
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8916 : 47 - 55
  • [45] Conditional Random Fields for Spanish Named Entity Recognition Using Unsupervised Features
    Copara, Jenny
    Ochoa, Jose
    Thorne, Camilo
    Glavas, Goran
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2016, 2016, 10022 : 175 - 186
  • [46] Human Action Recognition Using Manifold Learning and Hidden Conditional Random Fields
    Liu, Fawang
    Jia, Yunde
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 693 - 698
  • [47] Disease Named Entity Recognition Using Semisupervised Learning and Conditional Random Fields
    Suakkaphong, Nichalin
    Zhang, Zhu
    Chen, Hsinchun
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2011, 62 (04): : 727 - 737
  • [48] Automatic Social Role Recognition In Professional Meetings Using Conditional Random Fields
    Sapru, Ashtosh
    Bourlard, Herve
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1529 - 1533
  • [49] Pedestrian Intention Recognition using Latent-dynamic Conditional Random Fields
    Schulz, Andreas Th.
    Stiefelhagen, Rainer
    2015 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2015, : 622 - 627
  • [50] Named Entity Recognition in Bengali and Hindi Using MuRIL and Conditional Random Fields
    Kaushik Bose
    Kamal Sarkar
    SN Computer Science, 5 (7)