The Spot the Difference corpus: a multi-modal corpus of spontaneous task oriented spoken interactions

被引:0
|
作者
Lopes, Jose [1 ]
Hemmingsson, Nils [1 ]
Astrand, Oliver [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
来源
PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018) | 2018年
关键词
Dialogues; Spontaneous; Multi-modal;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper describes the Spot the Difference Corpus which contains 54 interactions between pairs of subjects interacting to find differences in two very similar scenes. The setup used, the participants' metadata and details about collection are described. We are releasing this corpus of task-oriented spontaneous dialogues. This release includes rich transcriptions, annotations, audio and video. We believe that this dataset constitutes a valuable resource to study several dimensions of human communication that go from turn-taking to the study of referring expressions. In our preliminary analyses we have looked at task success (how many differences were found out of the total number of differences) and how it evolves over time. In addition we have looked at scene complexity provided by the RGB components' entropy and how it could relate to speech overlaps, interruptions and the expression of uncertainty. We found there is a tendency that more complex scenes have more competitive interruptions.
引用
收藏
页码:1939 / 1945
页数:7
相关论文
共 50 条
  • [1] Building a multi-modal Arabic corpus (MMAC)
    Ashraf AbdelRaouf
    Colin A. Higgins
    Tony Pridmore
    Mahmoud Khalil
    International Journal on Document Analysis and Recognition (IJDAR), 2010, 13 : 285 - 302
  • [2] Building a multi-modal Arabic corpus (MMAC)
    AbdelRaouf, Ashraf
    Higgins, Colin A.
    Pridmore, Tony
    Khalil, Mahmoud
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2010, 13 (04) : 285 - 302
  • [3] A Framework of Multi-modal Corpus for Mandarin Learning
    Liu, Yang
    Yang, Chunting
    2009 IITA INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS ENGINEERING, PROCEEDINGS, 2009, : 476 - 479
  • [4] PentoRef: A Corpus of Spoken References in Task-oriented Dialogues
    Zarriess, Sina
    Hough, Julian
    Kennington, Casey
    Manuvinakurike, Ramesh
    DeVault, David
    Fernandez, Raquel
    Schlangen, David
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 125 - 131
  • [5] HeadTalk, HandTalk and the corpus: towards a framework for multi-modal, multi-media corpus development
    Knight, Dawn
    Evans, David
    Carter, Ronald
    Adolphs, Svenja
    CORPORA, 2009, 4 (01) : 1 - 32
  • [6] FEIDEGGER: A Multi-modal Corpus of Fashion Images and Descriptions in German
    Lefakis, Leonidas
    Akbik, Alan
    Vollgraf, Roland
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 451 - 454
  • [7] Incorporation of gene ontology in identification of protein interactions from biomedical corpus: a multi-modal approach
    Jha, Kanchan
    Saha, Sriparna
    Dutta, Pratik
    ANNALS OF OPERATIONS RESEARCH, 2024, 339 (03) : 1793 - 1811
  • [8] M3B Corpus: Multi-Modal Meeting Behavior Corpus for Group Meeting Assessment
    Soneda, Yusuke
    Matsuda, Yuki
    Arakawa, Yutaka
    Yasumoto, Keiichi
    UBICOMP/ISWC'19 ADJUNCT: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2019 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2019, : 825 - 834
  • [9] How to Address Smart Homes with a Social Robot? A Multi-modal Corpus of User Interactions with an Intelligent Environment
    Holthaus, Patrick
    Leichsenring, Christian
    Bernotat, Jasmin
    Richter, Viktor
    Pohling, Marian
    Carlmeyer, Birte
    Koester, Norman
    zu Borgsen, Sebastian Meyer
    Zorn, Rene
    Schiffhauer, Birte
    Engelmann, Kai Frederic
    Lier, Florian
    Schulz, Simon
    Cimiano, Philipp
    Eyssel, Friederike
    Hermann, Thomas
    Kummert, Franz
    Schlangen, David
    Wachsmuth, Sven
    Wagner, Petra
    Wrede, Britta
    Wrede, Sebastian
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3440 - 3446
  • [10] Aix Map Task corpus: The French multimodal corpus of task-oriented dialogue
    Gorisch, Jan
    Astesano, Corine
    Bard, Ellen Gurman
    Bigi, Brigitte
    Prevot, Laurent
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2648 - 2652