A Novel Scheme to Classify Read and Spontaneous Speech

被引：0

作者：

Kopparapu, Sunil Kumar ^{[1
]}

机构：

[1] TCS Res, Mumbai, India

来源：

SPEECH AND COMPUTER, SPECOM 2023, PT II | 2023年 / 14339卷

关键词：

Spoken speech analysis; Read and spontaneous speech; DeepSeech features;

D O I：

10.1007/978-3-031-48312-7_3

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The COVID-19 pandemic has led to an increased use of remote telephonic interviews, making it important to distinguish between scripted and spontaneous speech in audio recordings. In this paper, we propose a novel scheme for identifying read and spontaneous speech. Our approach uses a pre-trained DeepSpeech audio-to-alphabet recognition engine to generate a sequence of alphabets from the audio. From these alphabets, we derive features that allow us to discriminate between read and spontaneous speech. Our experimental results show that even a small set of self-explanatory features can effectively classify the two types of speech very effectively.

引用

页码：32 / 45

页数：14

共 50 条

[1] Syllable detection in read and spontaneous speech
Pfitzinger, HR
Burger, S
Heid, S
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1261 - 1264
[2] GRASS: The Graz Corpus of Read and Spontaneous Speech
Schuppler, Barbara
Hagmueller, Martin
Morales-Cordovilla, Juan A.
Pessentheiner, Hannes
LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1465 - 1470
[3] DETECTION OF TARGET PHONEMES IN SPONTANEOUS AND READ SPEECH
MEHTA, G
CUTLER, A
LANGUAGE AND SPEECH, 1988, 31 : 135 - 156
[4] Prosody for Mandarin Speech Recognition: a Comparative Study of Read and Spontaneous Speech
Yeung, Yu Ting
Qian, Yao
Lee, Tan
Soong, Frank K.
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1133 - +
[5] Modeling prosody for language identification on read and spontaneous speech
Rouas, JL
Farinas, J
Pellegrino, F
André-Obrecht, R
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 753 - 756
[6] Modeling prosody for language identification on read and spontaneous speech
Rouas, JL
Farinas, J
Pellegrino, F
André-Obrecht, R
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 40 - 43
[7] THE PROCESSING OF LEXICALLY STRESSED SYLLABLES IN READ AND SPONTANEOUS SPEECH
MCALLISTER, J
LANGUAGE AND SPEECH, 1991, 34 : 1 - 26
[8] DETECTING DEPRESSION: A COMPARISON BETWEEN SPONTANEOUS AND READ SPEECH
Alghowinem, Sharifa
Goecke, Roland
Wagner, Michael
Epps, Julien
Breakspear, Michael
Parker, Gordon
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7547 - 7551
[9] COMPARISON OF PROSODIC PROPERTIES BETWEEN READ AND SPONTANEOUS SPEECH MATERIAL
HOWELL, P
KADIHANIFI, K
SPEECH COMMUNICATION, 1991, 10 (02) : 163 - 169
[10] DIFFERENCES BETWEEN READ AND SPONTANEOUS SPEECH OF DEAF-CHILDREN
SMITH, CR
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1982, 72 (04): : 1304 - 1305

← 1 2 3 4 5 →