The SRI CLEO Speaker-State Corpus

被引:0
|
作者
Kathol, Andreas [1 ]
Shriberg, Elizabeth [1 ]
de Zambotti, Massimilano [1 ]
机构
[1] SRI Int, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
基金
美国国家科学基金会;
关键词
Speech corpora; psychophysiology; autonomic nervous system; speech features; emotion;
D O I
10.21437/Interspeech.2016-1141
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We introduce the SRI CLEO (Conversational Language about Everyday Objects) Speaker-State Corpus of speech, video, and biosignals. The goal of the corpus is providing insight on the speech and physiological changes resulting from subtle, context-based influences on affect and cognition. Speakers were prompted by collections of pictures of neutral everyday objects and were instructed to provide speech related to any subset of the objects for a preset period of time (120 or 180 seconds depending on task). The corpus provides signals for 43 speakers under four different speaker-state conditions: (1) neutral and emotionally charged audiovisual background; (2) cognitive load; (3) time pressure; and (4) various acted emotions. Unlike previous studies that have linked speaker state to the content of the speaking task itself, the CLEO prompts remain largely pragmatically, semantically, and affectively neutral across all conditions. This framework enables for more direct comparisons across both conditions and speakers. The corpus also includes more traditional speaker tasks involving reading and free-form reporting of neutral and emotionally charged content. The explored biosignals include skin conductance, respiration, blood pressure, and ECG. The corpus is in the final stages of processing and will be made available to the research community.
引用
收藏
页码:1541 / 1544
页数:4
相关论文
共 50 条
  • [41] Speaker recognition based on multilevel speech signal analysis on Polish corpus
    Drgas, Szymon
    Dabrowski, Adam
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (12) : 4195 - 4211
  • [42] A CORPUS-BASED STUDY OF BE-COPULA IN NATIVE SPEAKER AND NON-NATIVE SPEAKER LEARNERS' ARGUMENTATIVE ESSAYS
    Aziz, Roslina Abdul
    Don, Zuraidah Mohd
    JOURNAL OF NUSANTARA STUDIES-JONUS, 2022, 7 (02): : 21 - 43
  • [43] State-of-the-art in speaker recognition
    Faundez-Zanuy, M
    Monte-Moreno, E
    IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2005, 20 (05) : 7 - 12
  • [44] Evaluation of EMD-based speaker recognition using ISCSLP2006 Chinese speaker recognition evaluation corpus
    Kuroiwa, Shingo
    Tsuge, Satoru
    Kita, Masahiko
    Ren, Fuji
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 539 - +
  • [45] Evaluation of the epistemic state of the speaker/author
    Clifton, Charles, Jr.
    Frazier, Lyn
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2018, 71 (06): : 1482 - 1492
  • [46] VIEW FROM ... CLEO 2011 Ultraviolet goes solid-state
    Pile, David
    NATURE PHOTONICS, 2011, 5 (07) : 394 - 395
  • [47] The INTERSPEECH 2011 Speaker State Challenge
    Schuller, Bjoern
    Steidl, Stefan
    Batliner, Anton
    Schiel, Florian
    Krajewski, Jarek
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3208 - 3211
  • [48] SRI-LANKA - STATE TERRORISM
    WILSON, A
    ECONOMIC AND POLITICAL WEEKLY, 1981, 16 (27) : 1144 - 1144
  • [49] The current state of Sri Lanka Portuguese
    Nordhoff, Sebastian
    JOURNAL OF PIDGIN AND CREOLE LANGUAGES, 2013, 28 (02) : 425 - 434
  • [50] CLeLfPC: a Large Open Multi-Speaker Corpus of French Cued Speech
    Bigi, Brigitte
    Zimmermann, Maryvonne
    Andre, Carine
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 987 - 994