The Spot the Difference corpus: a multi-modal corpus of spontaneous task oriented spoken interactions

被引:0
|
作者
Lopes, Jose [1 ]
Hemmingsson, Nils [1 ]
Astrand, Oliver [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
来源
PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018) | 2018年
关键词
Dialogues; Spontaneous; Multi-modal;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper describes the Spot the Difference Corpus which contains 54 interactions between pairs of subjects interacting to find differences in two very similar scenes. The setup used, the participants' metadata and details about collection are described. We are releasing this corpus of task-oriented spontaneous dialogues. This release includes rich transcriptions, annotations, audio and video. We believe that this dataset constitutes a valuable resource to study several dimensions of human communication that go from turn-taking to the study of referring expressions. In our preliminary analyses we have looked at task success (how many differences were found out of the total number of differences) and how it evolves over time. In addition we have looked at scene complexity provided by the RGB components' entropy and how it could relate to speech overlaps, interruptions and the expression of uncertainty. We found there is a tendency that more complex scenes have more competitive interruptions.
引用
收藏
页码:1939 / 1945
页数:7
相关论文
共 50 条
  • [41] Task-Adversarial Adaptation for Multi-modal Recommendation
    Su, Hongzu
    Li, Jingjing
    Li, Fengling
    Zhu, Lei
    Lu, Ke
    Yang, Yang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6530 - 6538
  • [42] Task allocation in robot systems with multi-modal capabilities
    Hojda, Maciej
    IFAC PAPERSONLINE, 2015, 48 (03): : 2109 - 2114
  • [43] Corpus Design for Studying Linguistic Nudges in Human-Computer Spoken Interactions
    Kalashnikova, Natalia
    Pajak, Serge
    Le Guel, Fabrice
    Vasilescu, Ioana
    Serrano, Gemma
    Devillers, Laurence
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4079 - 4087
  • [44] A reception perspective on client-agent interactions in the DiaBiz corpus of spoken Polish
    Deckert, Mikolaj
    Cichosz, Anna
    IBERICA, 2024, (48):
  • [45] A multi-modal HMM for spoken word recognition under noisy environment
    Yoshida, T
    Hamamoto, T
    Hangai, S
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4016 - 4016
  • [46] A morphologically annotated longitudinal corpus of spoken Czech child-adult interactions
    Chroma, Anna
    Slama, Jakub
    Matiasovitsova, Klara
    Treichelova, Jolana
    LANGUAGE RESOURCES AND EVALUATION, 2025, 59 (01) : 413 - 436
  • [47] The implementation of service enabling with spoken language of a multi-modal system ozone
    Zhang, Sen
    Laprie, Yves
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 640 - +
  • [48] Map Task Corpus of Heritage BCMS spoken by second-generation speakers in Switzerland
    Lemmenmeier-Batinic, Dolores
    Batinic, Josip
    Escher, Anastasia
    LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (04) : 1607 - 1644
  • [49] Map Task Corpus of Heritage BCMS spoken by second-generation speakers in Switzerland
    Dolores Lemmenmeier-Batinić
    Josip Batinić
    Anastasia Escher
    Language Resources and Evaluation, 2023, 57 : 1607 - 1644
  • [50] Labelled data bank of spoken standard German - The Kiel corpus of read/spontaneous speech
    Kohler, KJ
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1938 - 1941