DISCRIMINATIVE SEGMENTAL CASCADES FOR FEATURE-RICH PHONE RECOGNITION

被引:0
|
作者
Tang, Hao [1 ]
Wang, Weiran [1 ]
Gimpel, Kevin [1 ]
Livescu, Karen [1 ]
机构
[1] Toyota Technol Inst, Chicago, IL 60637 USA
关键词
segmental conditional random field; structured prediction cascades; phone recognition; segment neural network; beam search;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discriminative segmental models, such as segmental conditional random fields (SCRFs) and segmental structured support vector machines (SSVMs), have had success in speech recognition via both lattice rescoring and first-pass decoding. However, such models suffer from slow decoding, hampering the use of computationally expensive features, such as segment neural networks or other high-order features. A typical solution is to use approximate decoding, either by beam pruning in a single pass or by beam pruning to generate a lattice followed by a second pass. In this work, we study discriminative segmental models trained with a hinge loss (i.e., segmental structured SVMs). We show that beam search is not suitable for learning rescoring models in this approach, though it gives good approximate decoding performance when the model is already well-trained. Instead, we consider an approach inspired by structured prediction cascades, which use max-marginal pruning to generate lattices. We obtain a high-accuracy phonetic recognition system with several expensive feature types: a segment neural network, a second-order language model, and second-order phone boundary features.
引用
收藏
页码:561 / 568
页数:8
相关论文
共 50 条
  • [21] Feature-rich magneto-electronic properties of bismuthene
    Chen, Szu-Chao
    Wu, Jhao-Ying
    Lin, Ming-Fa
    NEW JOURNAL OF PHYSICS, 2018, 20
  • [22] Feature-rich electronic excitations of silicene in external fields
    Wu, Jhao-Ying
    Chen, Szu-Chao
    Gumbs, Godfrey
    Lin, Ming-Fa
    PHYSICAL REVIEW B, 2016, 94 (20)
  • [23] Feature-rich plasmon excitations in sliding bilayer graphene
    Lin, Chiun-Yan
    Chiu, Chih-Wei
    Lin, Ming-Fa
    ANNALS OF PHYSICS, 2024, 460
  • [24] Feature-rich distance-based terrain synthesis
    Rusnell, Brennan
    Mould, David
    Eramian, Mark
    VISUAL COMPUTER, 2009, 25 (5-7): : 573 - 579
  • [25] Feature-Rich Geometric and Electronic Properties of Carbon Nanoscrolls
    Lin, Shih-Yang
    Chang, Sheng-Lin
    Chiang, Cheng-Ru
    Li, Wei-Bang
    Liu, Hsin-Yi
    Lin, Ming-Fa
    NANOMATERIALS, 2021, 11 (06)
  • [26] Feature-Rich Classifiers for Recognizing Textual Entailment in Indonesian
    Hidayat, Rani Aulia
    Khasanah, Isnaini Nurul
    Putri, Wava Carissa
    Mahendra, Rahmad
    AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 148 - 155
  • [27] New developments in security feature-rich smart cards
    Card Technol. Today, 2007, 6 (12-13):
  • [28] Feature-Rich Magnetic Quantization in Sliding Bilayer Graphenes
    Huang, Yao-Kung
    Chen, Szu-Chao
    Ho, Yen-Hung
    Lin, Chiun-Yan
    Lin, Ming-Fa
    SCIENTIFIC REPORTS, 2014, 4
  • [29] Enclosures become feature-rich, extremely flexible in application
    Cleaveland, P
    Olita, F
    Franklin, J
    CONTROL SOLUTIONS, 2002, 75 (01): : 44 - +
  • [30] Beyond Bloom: A Tutorial on Future Feature-Rich Filters
    Pandey, Prashant
    Farach-Colton, Martin
    Dayan, Niv
    Zhang, Huanchen
    COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 636 - 644