A Novel Scheme to Classify Read and Spontaneous Speech

被引:0
|
作者
Kopparapu, Sunil Kumar [1 ]
机构
[1] TCS Res, Mumbai, India
来源
关键词
Spoken speech analysis; Read and spontaneous speech; DeepSeech features;
D O I
10.1007/978-3-031-48312-7_3
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The COVID-19 pandemic has led to an increased use of remote telephonic interviews, making it important to distinguish between scripted and spontaneous speech in audio recordings. In this paper, we propose a novel scheme for identifying read and spontaneous speech. Our approach uses a pre-trained DeepSpeech audio-to-alphabet recognition engine to generate a sequence of alphabets from the audio. From these alphabets, we derive features that allow us to discriminate between read and spontaneous speech. Our experimental results show that even a small set of self-explanatory features can effectively classify the two types of speech very effectively.
引用
收藏
页码:32 / 45
页数:14
相关论文
共 50 条
  • [1] Syllable detection in read and spontaneous speech
    Pfitzinger, HR
    Burger, S
    Heid, S
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1261 - 1264
  • [2] GRASS: The Graz Corpus of Read and Spontaneous Speech
    Schuppler, Barbara
    Hagmueller, Martin
    Morales-Cordovilla, Juan A.
    Pessentheiner, Hannes
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1465 - 1470
  • [3] DETECTION OF TARGET PHONEMES IN SPONTANEOUS AND READ SPEECH
    MEHTA, G
    CUTLER, A
    LANGUAGE AND SPEECH, 1988, 31 : 135 - 156
  • [4] Prosody for Mandarin Speech Recognition: a Comparative Study of Read and Spontaneous Speech
    Yeung, Yu Ting
    Qian, Yao
    Lee, Tan
    Soong, Frank K.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1133 - +
  • [5] Modeling prosody for language identification on read and spontaneous speech
    Rouas, JL
    Farinas, J
    Pellegrino, F
    André-Obrecht, R
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 753 - 756
  • [6] Modeling prosody for language identification on read and spontaneous speech
    Rouas, JL
    Farinas, J
    Pellegrino, F
    André-Obrecht, R
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 40 - 43
  • [7] THE PROCESSING OF LEXICALLY STRESSED SYLLABLES IN READ AND SPONTANEOUS SPEECH
    MCALLISTER, J
    LANGUAGE AND SPEECH, 1991, 34 : 1 - 26
  • [8] DETECTING DEPRESSION: A COMPARISON BETWEEN SPONTANEOUS AND READ SPEECH
    Alghowinem, Sharifa
    Goecke, Roland
    Wagner, Michael
    Epps, Julien
    Breakspear, Michael
    Parker, Gordon
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7547 - 7551
  • [9] COMPARISON OF PROSODIC PROPERTIES BETWEEN READ AND SPONTANEOUS SPEECH MATERIAL
    HOWELL, P
    KADIHANIFI, K
    SPEECH COMMUNICATION, 1991, 10 (02) : 163 - 169
  • [10] DIFFERENCES BETWEEN READ AND SPONTANEOUS SPEECH OF DEAF-CHILDREN
    SMITH, CR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1982, 72 (04): : 1304 - 1305