The Spot the Difference corpus: a multi-modal corpus of spontaneous task oriented spoken interactions

被引：0

作者：

Lopes, Jose ^{[1
]}

Hemmingsson, Nils ^{[1
]}

Astrand, Oliver ^{[1
]}

机构：

[1] KTH Royal Inst Technol, Stockholm, Sweden

来源：

PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018) | 2018年

关键词：

Dialogues; Spontaneous; Multi-modal;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

This paper describes the Spot the Difference Corpus which contains 54 interactions between pairs of subjects interacting to find differences in two very similar scenes. The setup used, the participants' metadata and details about collection are described. We are releasing this corpus of task-oriented spontaneous dialogues. This release includes rich transcriptions, annotations, audio and video. We believe that this dataset constitutes a valuable resource to study several dimensions of human communication that go from turn-taking to the study of referring expressions. In our preliminary analyses we have looked at task success (how many differences were found out of the total number of differences) and how it evolves over time. In addition we have looked at scene complexity provided by the RGB components' entropy and how it could relate to speech overlaps, interruptions and the expression of uncertainty. We found there is a tendency that more complex scenes have more competitive interruptions.

引用

页码：1939 / 1945

页数：7

共 50 条

[41] Task-Adversarial Adaptation for Multi-modal Recommendation
Su, Hongzu
Li, Jingjing
Li, Fengling
Zhu, Lei
Lu, Ke
Yang, Yang
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6530 - 6538
[42] Task allocation in robot systems with multi-modal capabilities
Hojda, Maciej
IFAC PAPERSONLINE, 2015, 48 (03): : 2109 - 2114
[43] Corpus Design for Studying Linguistic Nudges in Human-Computer Spoken Interactions
Kalashnikova, Natalia
Pajak, Serge
Le Guel, Fabrice
Vasilescu, Ioana
Serrano, Gemma
Devillers, Laurence
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 4079 - 4087
[44] A reception perspective on client-agent interactions in the DiaBiz corpus of spoken Polish
Deckert, Mikolaj
Cichosz, Anna
IBERICA, 2024, (48):
[45] A multi-modal HMM for spoken word recognition under noisy environment
Yoshida, T
Hamamoto, T
Hangai, S
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4016 - 4016
[46] A morphologically annotated longitudinal corpus of spoken Czech child-adult interactions
Chroma, Anna
Slama, Jakub
Matiasovitsova, Klara
Treichelova, Jolana
LANGUAGE RESOURCES AND EVALUATION, 2025, 59 (01) : 413 - 436
[47] The implementation of service enabling with spoken language of a multi-modal system ozone
Zhang, Sen
Laprie, Yves
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 640 - +
[48] Map Task Corpus of Heritage BCMS spoken by second-generation speakers in Switzerland
Lemmenmeier-Batinic, Dolores
Batinic, Josip
Escher, Anastasia
LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (04) : 1607 - 1644
[49] Map Task Corpus of Heritage BCMS spoken by second-generation speakers in Switzerland
Dolores Lemmenmeier-Batinić
Josip Batinić
Anastasia Escher
Language Resources and Evaluation, 2023, 57 : 1607 - 1644
[50] Labelled data bank of spoken standard German - The Kiel corpus of read/spontaneous speech
Kohler, KJ
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1938 - 1941

← 1 2 3 4 5 →