The SRI CLEO Speaker-State Corpus

被引:0
|
作者
Kathol, Andreas [1 ]
Shriberg, Elizabeth [1 ]
de Zambotti, Massimilano [1 ]
机构
[1] SRI Int, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA
基金
美国国家科学基金会;
关键词
Speech corpora; psychophysiology; autonomic nervous system; speech features; emotion;
D O I
10.21437/Interspeech.2016-1141
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We introduce the SRI CLEO (Conversational Language about Everyday Objects) Speaker-State Corpus of speech, video, and biosignals. The goal of the corpus is providing insight on the speech and physiological changes resulting from subtle, context-based influences on affect and cognition. Speakers were prompted by collections of pictures of neutral everyday objects and were instructed to provide speech related to any subset of the objects for a preset period of time (120 or 180 seconds depending on task). The corpus provides signals for 43 speakers under four different speaker-state conditions: (1) neutral and emotionally charged audiovisual background; (2) cognitive load; (3) time pressure; and (4) various acted emotions. Unlike previous studies that have linked speaker state to the content of the speaking task itself, the CLEO prompts remain largely pragmatically, semantically, and affectively neutral across all conditions. This framework enables for more direct comparisons across both conditions and speakers. The corpus also includes more traditional speaker tasks involving reading and free-form reporting of neutral and emotionally charged content. The explored biosignals include skin conductance, respiration, blood pressure, and ECG. The corpus is in the final stages of processing and will be made available to the research community.
引用
收藏
页码:1541 / 1544
页数:4
相关论文
共 50 条
  • [1] SENSAY ANALYTICS™: A REAL-TIME SPEAKER-STATE PLATFORM
    Tsiartas, A.
    Albright, C.
    Bassiou, N.
    Frandsen, M.
    Miller, I.
    Shriberg, E.
    Smith, J.
    Voss, L.
    Wagner, V.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 6582 - 6583
  • [2] Speaker recognition with the Switchboard Corpus
    Lamel, L
    Gauvain, JL
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1067 - 1070
  • [3] Influence of Corpus Size on Speaker Verification
    Dustor, Adam
    Klosowski, Piotr
    Izydorczyk, Jacek
    Kopanski, Rafal
    COMPUTER NETWORKS, CN 2015, 2015, 522 : 242 - 249
  • [4] An open and free Speech Corpus for Speaker Recognition: The FSCSR Speech Corpus
    Bouziane, Ayoub
    Kadi, Houda
    Hourri, Soufiane
    Kharroubi, Jamal
    2016 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2016,
  • [5] A Free Synthetic Corpus for Speaker Diarization Research
    Edwards, Erik
    Brenndoerfer, Michael
    Robinson, Amanda
    Sadoughi, Najmeh
    Finley, Greg P.
    Korenevsky, Maxim
    Axtmann, Nico
    Miller, Mark
    Suendermann-Oeft, David
    SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 113 - 122
  • [6] Forensic speaker profiling in a Hungarian speech corpus
    Beke, Andras
    2018 9TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2018, : 379 - 384
  • [7] A New Speech Corpus in Spanish for Speaker Verification
    Garcia, N.
    Arias-Vergara, T.
    Orozco-Arroyave, J. R.
    Vargas-Bonilla, J. F.
    2016 XXI SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND ARTIFICIAL VISION (STSIVA), 2016,
  • [8] Speaker-adapted training on the Switchboard Corpus
    McDonough, J
    Anastasakos, T
    Zavaliagkos, G
    Gish, H
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1059 - 1062
  • [9] LOCUST - Longitudinal Corpus and Toolset for Speaker Verification
    Dmitriev, Evgeny
    Kim, Yulia
    Matveeva, Anastasia
    Montacie, Claude
    Boulard, Yannick
    Sinyavskaya, Yadviga
    Zhukova, Yulia
    Zarazinski, Adam
    Akhanov, Egor
    Viksnin, Ilya
    Shlykov, Andrei
    Usova, Maria
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1096 - 1100
  • [10] Call My Net Corpus: A Multilingual Corpus for Evaluation of Speaker Recognition Technology
    Jones, Karen
    Strassel, Stephanie
    Walker, Kevin
    Graff, David
    Wright, Jonathan
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2621 - 2624