Probabilistic speech feature extraction with context-sensitive Bottleneck neural networks

被引:3
|
作者
Woellmer, Martin [1 ]
Schuller, Bjoern [1 ]
机构
[1] Tech Univ Munich, Inst Human Machine Commun, D-80333 Munich, Germany
关键词
Probabilistic feature extraction; Bottleneck networks; Long Short-Term Memory; Bidirectional speech processing; CONNECTIONIST FEATURE-EXTRACTION; BIDIRECTIONAL LSTM; NECK FEATURES;
D O I
10.1016/j.neucom.2012.06.064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a novel context-sensitive feature extraction approach for spontaneous speech recognition. As bidirectional Long Short-Term Memory (BLSTM) networks are known to enable improved phoneme recognition accuracies by incorporating long-range contextual information into speech decoding, we integrate the BLSTM principle into a Tandem front-end for probabilistic feature extraction. Unlike the previously proposed approaches which exploit BLSTM modeling by generating a discrete phoneme prediction feature, our feature extractor merges continuous high-level probabilistic BLSTM features with low-level features. By combining BLSTM modeling and Bottleneck (BN) feature generation, we propose a novel front-end that allows us to produce context-sensitive probabilistic feature vectors of arbitrary size, independent of the network training targets. Evaluations on challenging spontaneous, conversational speech recognition tasks show that this concept prevails over recently published architectures for feature-level context modeling. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:113 / 120
页数:8
相关论文
共 50 条
  • [21] SPEECH FEATURE-EXTRACTION USING NEURAL NETWORKS
    NIRANJAN, M
    FALLSIDE, F
    LECTURE NOTES IN COMPUTER SCIENCE, 1990, 412 : 197 - 204
  • [22] Context-sensitive feature selection for lazy learners
    Domingos, P
    ARTIFICIAL INTELLIGENCE REVIEW, 1997, 11 (1-5) : 227 - 253
  • [23] CONTROLLABILITY OF CONTEXT-SENSITIVE PROBABILISTIC MIX-VALUED LOGICAL CONTROL NETWORKS WITH CONSTRAINTS
    Liu, Zhenbin
    Wang, Yuzhen
    Li, Haitao
    ASIAN JOURNAL OF CONTROL, 2015, 17 (01) : 246 - 254
  • [24] A Probabilistic Model for Sequence Alignment with Context-Sensitive Indels
    Hickey, Glenn
    Blanchette, Mathieu
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2011, 18 (11) : 1449 - 1464
  • [25] An integer programming approach to optimal control problems in context-sensitive probabilistic Boolean networks
    Kobayashi, Koichi
    Hiraishi, Kunihiko
    AUTOMATICA, 2011, 47 (06) : 1260 - 1264
  • [26] A Probabilistic Model for Sequence Alignment with Context-Sensitive Indels
    Hickey, Glenn
    Blanchette, Mathieu
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, 2011, 6577 : 85 - 103
  • [27] Static output feedback set stabilization for context-sensitive probabilistic Boolean control networks
    Tong, Liyun
    Liu, Yang
    Lou, Jungang
    Lu, Jianquan
    Alsaadi, Fuad E.
    APPLIED MATHEMATICS AND COMPUTATION, 2018, 332 : 263 - 275
  • [28] Speech enhancement method using context-sensitive attention mechanism and recurrent neural network
    Lan, Tian
    Hui, Guoqiang
    Li, Meng
    Lü, Yilan
    Liu, Qiao
    Shengxue Xuebao/Acta Acustica, 2020, 45 (06): : 897 - 905
  • [29] Embedding Time Differences in Context-sensitive Neural Networks for Learning Time to Event
    Dehghani, Nazanin
    Hajipoor, Hassan
    Amiri, Hadi
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 630 - 636
  • [30] Context-Sensitive Temporal Feature Learning for Gait Recognition
    Huang, Xiaohu
    Zhu, Duowang
    Wang, Hao
    Wang, Xinggang
    Yang, Bo
    He, Botao
    Liu, Wenyu
    Feng, Bin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12889 - 12898