A multimodal analysis of vocal and visual backchannels in spontaneous dialogs

被引：0

作者：

Truong, Khiet P. ^{[1
]}

Poppe, Ronald ^{[1
]}

de Kok, Iwan ^{[1
]}

Heylen, Dirk ^{[1
]}

机构：

[1] Univ Twente, Enschede, Netherlands

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

listener response; backchannel; continuer; prediction; head nod; vocalization; gaze; pitch; FEATURES;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Backchannels (BCs) are short vocal and visual listener responses that signal attention, interest, and understanding to the speaker. Previous studies have investigated BC prediction in telephone-style dialogs from prosodic cues. In contrast, we consider spontaneous face-to-face dialogs. The additional visual modality allows speaker and listener to monitor each other's attention continuously, and we hypothesize that this affects the BC-inviting cues. In this study, we investigate how gaze, in addition to prosody, can cue BCs. Moreover, we focus on the type of BC performed, with the aim to find out whether vocal and visual BCs are invited by similar cues. In contrast to telephone-style dialogs, we do not find rising/falling pitch to be a BC-inviting cue. However, in a face-to-face setting, gaze appears to cue BCs. In addition, we find that mutual gaze occurs significantly more often during visual BCs. Moreover, vocal BCs are more likely to be timed during pauses in the speaker's speech.

引用

页码：2984 / 2987

页数：4

共 50 条

[1] Multimodal Backchannels for Embodied Conversational Agents
Bevacqua, Elisabetta
Pammi, Sathish
Hyniewska, Sylwia Julia
Schroeder, Marc
Pelachaud, Catherine
INTELLIGENT VIRTUAL AGENTS, IVA 2010, 2010, 6356 : 194 - 200
[2] Predicting Listener Backchannels: A Probabilistic Multimodal Approach
Morency, Louis-Philippe
de Kok, Iwan
Gratch, Jonathan
INTELLIGENT VIRTUAL AGENTS, PROCEEDINGS, 2008, 5208 : 176 - +
[3] An analysis of turn-taking and backchannels based on prosodic and syntactic features in Japanese map task dialogs
Koiso, H
Horiuchi, Y
Tutiya, S
Ichikawa, A
Den, Y
LANGUAGE AND SPEECH, 1998, 41 : 295 - 321
[4] A probabilistic multimodal approach for predicting listener backchannels
Morency, Louis-Philippe
de Kok, Iwan
Gratch, Jonathan
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2010, 20 (01) : 70 - 84
[5] A probabilistic multimodal approach for predicting listener backchannels
Louis-Philippe Morency
Iwan de Kok
Jonathan Gratch
Autonomous Agents and Multi-Agent Systems, 2010, 20 : 70 - 84
[6] Visual and multimodal analysis of human spontaneous behaviour: Introduction to the Special Issue
Pantic, Maja
Cohn, Jeffrey F.
IMAGE AND VISION COMPUTING, 2009, 27 (12) : 1741 - 1742
[7] VOCAL DIALOGS IN THE NEONATAL-PERIOD
ROSENTHAL, MK
DEVELOPMENTAL PSYCHOLOGY, 1982, 18 (01) : 17 - 21
[8] Multimodal impressions of voice quality settings: the role of vocal and visual symbolism
Madureira, Sandra
Fontes, Mario A. S.
FRONTIERS IN COMMUNICATION, 2023, 8
[9] Multimodal Emotion Recognition Using Visual, Vocal and Physiological Signals: A Review
Udahemuka, Gustave
Djouani, Karim
Kurien, Anish M.
APPLIED SCIENCES-BASEL, 2024, 14 (17):
[10] Multimodal Persona Based Generation of Comic Dialogs
Agrawal, Harsh
Mishra, Aditya M.
Gupta, Manish
Mausam
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14150 - 14164

← 1 2 3 4 5 →