Repetition Detection in Stuttered Speech

被引:10
|
作者
Ramteke, Pravin B. [1 ]
Koolagudi, Shashidhar G. [1 ]
Afroz, Fathima [1 ]
机构
[1] Natl Inst Technol Karnataka, Surathkal 575025, Karnataka, India
关键词
MFCCs; Formants; Shimmer; Jitter; Dynamic time warping; CLASSIFICATION;
D O I
10.1007/978-81-322-2538-6_63
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper mainly focuses on detection of repetitions in stuttered speech. The stuttered speech signal is divided into isolated units based on energy. Mel-frequency cepstrum coefficients (MFCCs), formants and shimmer are used as features for repetition recognition. These features are extracted from each isolated unit. Using Dynamic Time Warping (DTW) the features of each isolated unit are compared with those subsequent units within one second interval of speech. Based on the analysis of scores obtained from DTW a threshold is set, if the score is below the set threshold then the units are identified as repeated events. Twenty seven seconds of speech data used in this work, consists of 50 repetition events. The result shows that the combination of MFCCs, formants and shimmer can be used for the recognition of repetitions in stuttered speech. Out of 50 repetitions, 47 are correctly identified.
引用
收藏
页码:611 / 617
页数:7
相关论文
共 50 条
  • [41] A model of serial order problems in fluent, stuttered and agrammatic speech
    Howell, Peter
    HUMAN MOVEMENT SCIENCE, 2007, 26 (05) : 728 - 741
  • [42] THE CONTRIBUTION OF THE EXCITATORY SOURCE TO THE PERCEPTION OF NEUTRAL VOWELS IN STUTTERED SPEECH
    HOWELL, P
    WILLIAMS, M
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 84 (01): : 80 - 89
  • [43] Recognition and Classification of Pauses in Stuttered Speech using Acoustic Features
    Afroz, Fathima
    Koolagudi, Shashidhar G.
    2019 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2019, : 921 - 926
  • [44] Segmented Analysis of Eye Gaze Behaviors of Fluent and Stuttered Speech
    Hudock, Daniel
    Stuart, Andrew
    Saltuklaroglu, Tim
    Zhang, Jianliang
    Murray, Nicholas
    Kalinowski, Joseph
    Altieri, Nicholas
    CANADIAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY AND AUDIOLOGY, 2015, 39 (02): : 134 - 145
  • [45] REPETITION IN SCHIZOPHRENIC SPEECH
    MANSCHRECK, TC
    MAHER, BA
    HOOVER, TM
    AMES, D
    LANGUAGE AND SPEECH, 1985, 28 : 255 - 268
  • [46] Voice onset time and formant onset frequencies in Arabic stuttered speech
    Al-Tamimi, Feda
    Howell, Peter
    CLINICAL LINGUISTICS & PHONETICS, 2021, 35 (06) : 493 - 508
  • [47] Empirical Mode Decomposition: A way for finding Pitch (Stuttered speech signal)
    Raju, N.
    Neelamegam, P.
    RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 (06): : 1030 - 1036
  • [48] Autonomic and emotional responses of graduate student clinicians in speech-language pathology to stuttered speech
    Guntupalli, Vijaya K.
    Nanjundeswaran, Chayadevie
    Dayalu, Vikram N.
    Kalinowski, Joseph
    INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS, 2012, 47 (05) : 603 - 608
  • [49] Physiological Correlates of Fluent and Stuttered Speech Production in Preschool Children Who Stutter
    Walsh, Bridget
    Usler, Evan
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2019, 62 (12): : 4309 - 4323
  • [50] Automatic Syllable Repetition Detection in Continuous Speech Based on Linear Prediction Coefficients
    Kobus, Adam
    Kuniszyk-Jozkowiak, Wieslawa
    Codello, Ireneusz
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 295 - 304