Repetition Detection in Stuttered Speech

被引:10
|
作者
Ramteke, Pravin B. [1 ]
Koolagudi, Shashidhar G. [1 ]
Afroz, Fathima [1 ]
机构
[1] Natl Inst Technol Karnataka, Surathkal 575025, Karnataka, India
关键词
MFCCs; Formants; Shimmer; Jitter; Dynamic time warping; CLASSIFICATION;
D O I
10.1007/978-81-322-2538-6_63
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper mainly focuses on detection of repetitions in stuttered speech. The stuttered speech signal is divided into isolated units based on energy. Mel-frequency cepstrum coefficients (MFCCs), formants and shimmer are used as features for repetition recognition. These features are extracted from each isolated unit. Using Dynamic Time Warping (DTW) the features of each isolated unit are compared with those subsequent units within one second interval of speech. Based on the analysis of scores obtained from DTW a threshold is set, if the score is below the set threshold then the units are identified as repeated events. Twenty seven seconds of speech data used in this work, consists of 50 repetition events. The result shows that the combination of MFCCs, formants and shimmer can be used for the recognition of repetitions in stuttered speech. Out of 50 repetitions, 47 are correctly identified.
引用
收藏
页码:611 / 617
页数:7
相关论文
共 50 条
  • [31] Segment-Removal Based Stuttered Speech Remediation
    Arbajian, Pierre
    Hajja, Ayman
    Ras, Zbigniew W.
    Wieczorkowska, Alicja A.
    NEW FRONTIERS IN MINING COMPLEX PATTERNS, NFMCP 2017, 2018, 10785 : 16 - 34
  • [32] INTERRELATIONSHIPS AMONG FLUENCY PRODUCING VARIABLES IN STUTTERED SPEECH
    WEBSTER, RL
    LUBKER, BB
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1968, 11 (04): : 754 - &
  • [33] The University College London Archive of Stuttered Speech (UCLASS)
    Howell, Peter
    Davis, Stephen
    Bartrip, Jon
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2009, 52 (02): : 556 - 569
  • [34] Computer-Assisted Disfluency Counts for Stuttered Speech
    Heeman, Peter A.
    McMillin, Andy
    Yaruss, J. Scott
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1324 - +
  • [35] MOLECULAR SELF-ANALYSES OF STUTTERED SPEECH VIA SPEECH TIME EXPANSION
    KROLL, RM
    OKEEFE, BM
    JOURNAL OF FLUENCY DISORDERS, 1985, 10 (02) : 93 - 105
  • [36] Automatic recognition of repetitions in stuttered speech: Using end-point detection and dynamic time warping
    Yeh, P. H.
    Yang, S. L.
    Yang, C. C.
    Shieh, M. D.
    10TH OXFORD DYSFLUENCY CONFERENCE, ODC 2014, 2015, 193 : 356 - 356
  • [38] Adaptive Optimization Based Neural Network for Classification of Stuttered Speech
    Manjula, G.
    Shivakumar, M.
    Geetha, Y. V.
    PROCEEDINGS OF 2019 THE 3RD INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, SECURITY AND PRIVACY (ICCSP 2019) WITH WORKSHOP 2019 THE 4TH INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP 2019), 2019, : 93 - 98
  • [39] ACOUSTIC ANALYSIS AND PERCEPTION OF VOWELS IN CHILDRENS AND TEENAGERS STUTTERED SPEECH
    HOWELL, P
    WILLIAMS, M
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1992, 91 (03): : 1697 - 1706
  • [40] LISTENER PREFERENCES FOR STUTTERED AND SYLLABLE-TIMED SPEECH PRODUCTION
    MALLARD, AR
    MEYER, LA
    JOURNAL OF FLUENCY DISORDERS, 1979, 4 (02) : 117 - 121