The SRI CLEO Speaker-State Corpus

被引：0

作者：

Kathol, Andreas ^{[1
]}

Shriberg, Elizabeth ^{[1
]}

de Zambotti, Massimilano ^{[1
]}

机构：

[1] SRI Int, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA

来源：

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年

基金：

美国国家科学基金会;

关键词：

Speech corpora; psychophysiology; autonomic nervous system; speech features; emotion;

D O I：

10.21437/Interspeech.2016-1141

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We introduce the SRI CLEO (Conversational Language about Everyday Objects) Speaker-State Corpus of speech, video, and biosignals. The goal of the corpus is providing insight on the speech and physiological changes resulting from subtle, context-based influences on affect and cognition. Speakers were prompted by collections of pictures of neutral everyday objects and were instructed to provide speech related to any subset of the objects for a preset period of time (120 or 180 seconds depending on task). The corpus provides signals for 43 speakers under four different speaker-state conditions: (1) neutral and emotionally charged audiovisual background; (2) cognitive load; (3) time pressure; and (4) various acted emotions. Unlike previous studies that have linked speaker state to the content of the speaking task itself, the CLEO prompts remain largely pragmatically, semantically, and affectively neutral across all conditions. This framework enables for more direct comparisons across both conditions and speakers. The corpus also includes more traditional speaker tasks involving reading and free-form reporting of neutral and emotionally charged content. The explored biosignals include skin conductance, respiration, blood pressure, and ECG. The corpus is in the final stages of processing and will be made available to the research community.

引用

页码：1541 / 1544

页数：4

共 50 条

[21] Association of lexical and collocation knowledge: A comparative analysis of a learner corpus of English and a native speaker corpus
Kim, Sung-Yeon
Shin, Dongkwang
Kim, Kyung-Sook
VIAL-VIGO INTERNATIONAL JOURNAL OF APPLIED LINGUISTICS, 2024, 21 : 67 - 96
[22] Association of lexical and collocation knowledge: A comparative analysis of a learner corpus of English and a native speaker corpus
Kim, Sung-Yeon
Shin, Dongkwang
Kim, Kyung-Sook
VIAL-VIGO INTERNATIONAL JOURNAL OF APPLIED LINGUISTICS, 2024, 21 : 67 - 96
[23] Differences in English Vocabulary Use: Insights from Spoken Learner Corpus and Native Speaker Corpus
Genc, Bilal
EGITIM VE BILIM-EDUCATION AND SCIENCE, 2013, 38 (167): : 41 - 49
[24] Minister of State for Care confirmed as speaker this MarchMinister of State for Care confirmed as speaker this March
British Dental Journal, 2025, 238 (4) : 281 - 281
[25] Speaker verification with TIMIT corpus - some remarks on classical methods
Dustor, Adam
2020 SIGNAL PROCESSING - ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2020, : 174 - 179
[26] VoCMex: A voice corpus in Mexican Spanish for research in speaker recognition
Olguín-Espinoza J.-M.
Mayorga-Ortiz P.
Hidalgo-Silva H.
Vizcarra-Corral L.
Mendiola-Cárdenas M.-L.
Olguín-Espinoza, J.-M. (molguin@uabc.edu.mx), 1600, Kluwer Academic Publishers (16): : 295 - 302
[27] AHUMADA: A large speech corpus in Spanish for speaker identification and verification
Ortega-Garcia, J
Gonzalez-Rodriguez, J
Marrero-Aguiar, V
Diaz-Gomez, JJ
Garcia-Jimenez, R
Lucena-Molina, J
Sanchez-Molero, JAG
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 773 - 776
[28] AHUMADA: A large speech corpus in Spanish for speaker characterization and identification
Ortega-Garcia, J
Gonzalez-Rodriguez, J
Marrero-Aguiar, V
SPEECH COMMUNICATION, 2000, 31 (2-3) : 255 - 264
[29] SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers
Arezzo, Alessandro
Berretti, Stefano
PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
[30] Multilingual Speaker Age Recognition: Regression Analyses on the Lwazi Corpus
Feld, Michael
Barnard, Etienne
van Heerden, Charl
Mueller, Christian
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 534 - 539

← 1 2 3 4 5 →