The SRI CLEO Speaker-State Corpus

被引:0
|
作者
Kathol, Andreas [1 ]
Shriberg, Elizabeth [1 ]
de Zambotti, Massimilano [1 ]
机构
[1] SRI Int, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
基金
美国国家科学基金会;
关键词
Speech corpora; psychophysiology; autonomic nervous system; speech features; emotion;
D O I
10.21437/Interspeech.2016-1141
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We introduce the SRI CLEO (Conversational Language about Everyday Objects) Speaker-State Corpus of speech, video, and biosignals. The goal of the corpus is providing insight on the speech and physiological changes resulting from subtle, context-based influences on affect and cognition. Speakers were prompted by collections of pictures of neutral everyday objects and were instructed to provide speech related to any subset of the objects for a preset period of time (120 or 180 seconds depending on task). The corpus provides signals for 43 speakers under four different speaker-state conditions: (1) neutral and emotionally charged audiovisual background; (2) cognitive load; (3) time pressure; and (4) various acted emotions. Unlike previous studies that have linked speaker state to the content of the speaking task itself, the CLEO prompts remain largely pragmatically, semantically, and affectively neutral across all conditions. This framework enables for more direct comparisons across both conditions and speakers. The corpus also includes more traditional speaker tasks involving reading and free-form reporting of neutral and emotionally charged content. The explored biosignals include skin conductance, respiration, blood pressure, and ECG. The corpus is in the final stages of processing and will be made available to the research community.
引用
收藏
页码:1541 / 1544
页数:4
相关论文
共 50 条
  • [11] THE SRI NIST 2008 SPEAKER RECOGNITION EVALUATION SYSTEM
    Kajarekar, Sachin S.
    Scheffer, Nicolas
    Graciarena, Martin
    Shriberg, Elizabeth
    Stolcke, Andreas
    Ferrer, Luciana
    Bocklet, Tobias
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4205 - 4208
  • [12] THE SRI NIST 2010 SPEAKER RECOGNITION EVALUATION SYSTEM
    Scheffer, Nicolas
    Ferrer, Luciana
    Graciarena, Martin
    Kajarekar, Sachin
    Shriberg, Elizabeth
    Stolcke, Andreas
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5292 - 5295
  • [13] The Nautilus Speaker Characterization Corpus: Speech Recordings and Labels of Speaker Characteristics and Voice Descriptions
    Gallardo, Laura Fernandez
    Weiss, Benjamin
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2837 - 2842
  • [14] Speaker identification by computer and human evaluated on the SPIDRE corpus
    Ezzaidi, Hassan
    Rouat, Jean
    Canadian Acoustics - Acoustique Canadienne, 2000, 28 (03): : 156 - 157
  • [15] Testing voice mimicry with the YOHO speaker verification corpus
    Lau, YW
    Tran, D
    Wagner, M
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 4, PROCEEDINGS, 2005, 3684 : 15 - 21
  • [16] The speaker of the State Duma
    Uglanov, A
    RUSSIAN POLITICS AND LAW, 1997, 35 (01): : 95 - 96
  • [17] Annotating Speaker Stance in Discourse: The Brexit Blog Corpus
    Simaki, Vasiliki
    Paradis, Carita
    Skeppstedt, Maria
    Sahlgren, Magnus
    Kucher, Kostiantyn
    Kerren, Andreas
    CORPUS LINGUISTICS AND LINGUISTIC THEORY, 2020, 16 (02) : 215 - 248
  • [18] CIVIL Corpus: Voice Quality for Speaker Forensic Comparison
    San Segundo, Eugenia
    Alves, Helena
    Fernandez Trinidad, Marianela
    CORPUS RESOURCES FOR DESCRIPTIVE AND APPLIED STUDIES. CURRENT CHALLENGES AND FUTURE DIRECTIONS: SELECTED PAPERS FROM THE 5TH INTERNATIONAL CONFERENCE ON CORPUS LINGUISTICS (CILC2013), 2013, 95 : 587 - 593
  • [19] SRI's 2004 NIST speaker recognition evaluation system
    Kajarekar, SS
    Ferrer, L
    Shriberg, E
    Sonmez, K
    Stolcke, A
    Venkatarman, A
    Zheng, J
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 173 - 176
  • [20] The SRI Speech-Based Collaborative Learning Corpus
    Richey, Colleen
    D'Angelo, Cynthia
    Alozie, Nonye
    Bratt, Harry
    Shriberg, Elizabeth
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1550 - 1554