A multimodal analysis of vocal and visual backchannels in spontaneous dialogs

被引:0
|
作者
Truong, Khiet P. [1 ]
Poppe, Ronald [1 ]
de Kok, Iwan [1 ]
Heylen, Dirk [1 ]
机构
[1] Univ Twente, Enschede, Netherlands
关键词
listener response; backchannel; continuer; prediction; head nod; vocalization; gaze; pitch; FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Backchannels (BCs) are short vocal and visual listener responses that signal attention, interest, and understanding to the speaker. Previous studies have investigated BC prediction in telephone-style dialogs from prosodic cues. In contrast, we consider spontaneous face-to-face dialogs. The additional visual modality allows speaker and listener to monitor each other's attention continuously, and we hypothesize that this affects the BC-inviting cues. In this study, we investigate how gaze, in addition to prosody, can cue BCs. Moreover, we focus on the type of BC performed, with the aim to find out whether vocal and visual BCs are invited by similar cues. In contrast to telephone-style dialogs, we do not find rising/falling pitch to be a BC-inviting cue. However, in a face-to-face setting, gaze appears to cue BCs. In addition, we find that mutual gaze occurs significantly more often during visual BCs. Moreover, vocal BCs are more likely to be timed during pauses in the speaker's speech.
引用
收藏
页码:2984 / 2987
页数:4
相关论文
共 50 条
  • [1] Multimodal Backchannels for Embodied Conversational Agents
    Bevacqua, Elisabetta
    Pammi, Sathish
    Hyniewska, Sylwia Julia
    Schroeder, Marc
    Pelachaud, Catherine
    INTELLIGENT VIRTUAL AGENTS, IVA 2010, 2010, 6356 : 194 - 200
  • [2] Predicting Listener Backchannels: A Probabilistic Multimodal Approach
    Morency, Louis-Philippe
    de Kok, Iwan
    Gratch, Jonathan
    INTELLIGENT VIRTUAL AGENTS, PROCEEDINGS, 2008, 5208 : 176 - +
  • [3] An analysis of turn-taking and backchannels based on prosodic and syntactic features in Japanese map task dialogs
    Koiso, H
    Horiuchi, Y
    Tutiya, S
    Ichikawa, A
    Den, Y
    LANGUAGE AND SPEECH, 1998, 41 : 295 - 321
  • [4] A probabilistic multimodal approach for predicting listener backchannels
    Morency, Louis-Philippe
    de Kok, Iwan
    Gratch, Jonathan
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2010, 20 (01) : 70 - 84
  • [5] A probabilistic multimodal approach for predicting listener backchannels
    Louis-Philippe Morency
    Iwan de Kok
    Jonathan Gratch
    Autonomous Agents and Multi-Agent Systems, 2010, 20 : 70 - 84
  • [6] Visual and multimodal analysis of human spontaneous behaviour: Introduction to the Special Issue
    Pantic, Maja
    Cohn, Jeffrey F.
    IMAGE AND VISION COMPUTING, 2009, 27 (12) : 1741 - 1742
  • [7] VOCAL DIALOGS IN THE NEONATAL-PERIOD
    ROSENTHAL, MK
    DEVELOPMENTAL PSYCHOLOGY, 1982, 18 (01) : 17 - 21
  • [8] Multimodal impressions of voice quality settings: the role of vocal and visual symbolism
    Madureira, Sandra
    Fontes, Mario A. S.
    FRONTIERS IN COMMUNICATION, 2023, 8
  • [9] Multimodal Emotion Recognition Using Visual, Vocal and Physiological Signals: A Review
    Udahemuka, Gustave
    Djouani, Karim
    Kurien, Anish M.
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [10] Multimodal Persona Based Generation of Comic Dialogs
    Agrawal, Harsh
    Mishra, Aditya M.
    Gupta, Manish
    Mausam
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14150 - 14164