Whole-Word Recognition from Articulatory Movements for Silent Speech Interfaces

被引:0
|
作者
Wang, Jun [1 ]
Samal, Ashok
Green, Jordan R. [1 ]
Rudzicz, Frank
机构
[1] Univ Nebraska, Dept Special Educ & Commun Disorders, Lincoln, NE 68588 USA
关键词
silent speech recognition; speech impairment; laryngectomy; support vector machine; KNOWLEDGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Articulation-based silent speech interfaces convert silently produced speech movements into audible words. These systems are still in their experimental stages, but have significant potential for facilitating oral communication in persons with laryngectomy or speech impairments. In this paper, we report the result of a novel, real-time algorithm that recognizes whole-words based on articulatory movements. This approach differs from prior work that has focused primarily on phoneme-level recognition based on articulatory features. On average, our algorithm missed 1.93 words in a sequence of twenty-five words with an average latency of 0.79 seconds for each word prediction using a data set of 5,500 isolated word samples collected from ten speakers. The results demonstrate the effectiveness of our approach and its potential for building a real-time articulation-based silent speech interface for health applications.
引用
收藏
页码:1326 / 1329
页数:4
相关论文
共 50 条
  • [1] SENTENCE RECOGNITION FROM ARTICULATORY MOVEMENTS FOR SILENT SPEECH INTERFACES
    Wang, Jun
    Samal, Ashok
    Green, Jordan R.
    Rudzicz, Frank
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4985 - 4988
  • [2] WHOLE-WORD SEGMENTAL SPEECH RECOGNITION WITH ACOUSTIC WORD EMBEDDINGS
    Shi, Bowen
    Settle, Shane
    Livescu, Karen
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 164 - 171
  • [4] Deficits in audiovisual speech perception in normal aging emerge at the level of whole-word recognition
    Stevenson, Ryan A.
    Nelms, Caitlin E.
    Baum, Sarah H.
    Zurkovsky, Lilia
    Barense, Morgan D.
    Newhouse, Paul A.
    Wallace, Mark T.
    NEUROBIOLOGY OF AGING, 2015, 36 (01) : 283 - 291
  • [5] Does "whole-word shape" play a role in visual word recognition?
    Perea, M
    Rosa, E
    PERCEPTION & PSYCHOPHYSICS, 2002, 64 (05): : 785 - 794
  • [6] Does “whole-word shape” play a role in visual word recognition?
    Manuel Perea
    Eva Rosa
    Perception & Psychophysics, 2002, 64 : 785 - 794
  • [7] Robust speech recognition against misdetection using whole-word HMMs and relaxed algorithm for likelihood calculation
    Hayasaka, Noboru
    IEEJ Transactions on Electronics, Information and Systems, 2015, 135 (10) : 1236 - 1243
  • [8] Multisensory Speech Perception in Autism Spectrum Disorder: From Phoneme to Whole-Word Perception
    Stevenson, Ryan A.
    Baum, Sarah H.
    Segers, Magali
    Ferber, Susanne
    Barense, Morgan D.
    Wallace, Mark T.
    AUTISM RESEARCH, 2017, 10 (07) : 1280 - 1290
  • [9] Whole-word measures and the speech production of typically developing Dutch children
    Beers, Mieke
    Rodenburg-Van Wee, Marianne
    Gerrits, Ellen
    CLINICAL LINGUISTICS & PHONETICS, 2019, 33 (12) : 1149 - 1164
  • [10] Speaker-Independent Silent Speech Recognition From Flesh-Point Articulatory Movements Using an LSTM Neural Network
    Kim, Myungjong
    Cao, Beiming
    Mau, Ted
    Wang, Jun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (12) : 2323 - 2336