Probabilistic speech feature extraction with context-sensitive Bottleneck neural networks

被引:3
|
作者
Woellmer, Martin [1 ]
Schuller, Bjoern [1 ]
机构
[1] Tech Univ Munich, Inst Human Machine Commun, D-80333 Munich, Germany
关键词
Probabilistic feature extraction; Bottleneck networks; Long Short-Term Memory; Bidirectional speech processing; CONNECTIONIST FEATURE-EXTRACTION; BIDIRECTIONAL LSTM; NECK FEATURES;
D O I
10.1016/j.neucom.2012.06.064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a novel context-sensitive feature extraction approach for spontaneous speech recognition. As bidirectional Long Short-Term Memory (BLSTM) networks are known to enable improved phoneme recognition accuracies by incorporating long-range contextual information into speech decoding, we integrate the BLSTM principle into a Tandem front-end for probabilistic feature extraction. Unlike the previously proposed approaches which exploit BLSTM modeling by generating a discrete phoneme prediction feature, our feature extractor merges continuous high-level probabilistic BLSTM features with low-level features. By combining BLSTM modeling and Bottleneck (BN) feature generation, we propose a novel front-end that allows us to produce context-sensitive probabilistic feature vectors of arbitrary size, independent of the network training targets. Evaluations on challenging spontaneous, conversational speech recognition tasks show that this concept prevails over recently published architectures for feature-level context modeling. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:113 / 120
页数:8
相关论文
共 50 条
  • [31] Optimal Finite-horizon Control Problem of Context-sensitive Probabilistic Boolean Networks with Perturbation
    Liu Zhenbin
    Wang Yuzhen
    PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 140 - 145
  • [32] Optimal Control for Context-Sensitive Probabilistic Boolean Networks with Perturbation using Probabilisitic Model Checking
    Wei, Ou
    Guo, Zonghao
    Niu, Yun
    Liao, Wenyuan
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 210 - 216
  • [33] Answering queries from context-sensitive probabilistic knowledge bases
    Ngo, L
    Haddawy, P
    THEORETICAL COMPUTER SCIENCE, 1997, 171 (1-2) : 147 - 177
  • [34] An Enhanced Context-sensitive Proximity Model for Probabilistic Information Retrieval
    Zhao, Jiashu
    Huang, Jimmy Xiangji
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1131 - 1134
  • [35] Probabilistic method for context-sensitive detection of polyps in CT colonography
    Naeppi, Janne J.
    Regge, Daniele
    Yoshida, Hiroyuki
    MEDICAL IMAGING 2011: COMPUTER-AIDED DIAGNOSIS, 2011, 7963
  • [36] Drop Fingerprint Recognition Based on Feature Extraction and Probabilistic Neural Networks
    Song, Qing
    Li, Jie
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT III, 2011, 7004 : 398 - 403
  • [37] Context-Sensitive Prediction of Facial Expressivity using Multimodal Hierarchical Bayesian Neural Networks
    Joshi, Ajjen
    Ghosh, Soumya
    Gunnery, Sarah
    Tickle-Degnen, Linda
    Sclaroff, Stan
    Betke, Margrit
    PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 278 - 285
  • [38] Incremental training of first order recurrent neural networks to predict a context-sensitive language
    Chalup, SK
    Blair, AD
    NEURAL NETWORKS, 2003, 16 (07) : 955 - 972
  • [39] Neural Dynamics of Context-sensitive Adjustments in Cognitive Flexibility
    Siqi-Liu, Audrey
    Egner, Tobias
    Woldorff, Marty G.
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2022, 34 (03) : 480 - 494
  • [40] Deep Convolutional Neural Networks for Feature Extraction in Speech Emotion Recognition
    Heracleous, Panikos
    Mohammad, Yasser
    Yoneyama, Akio
    HUMAN-COMPUTER INTERACTION. RECOGNITION AND INTERACTION TECHNOLOGIES, HCI 2019, PT II, 2019, 11567 : 117 - 132