A multimodal analysis of vocal and visual backchannels in spontaneous dialogs

被引:0
|
作者
Truong, Khiet P. [1 ]
Poppe, Ronald [1 ]
de Kok, Iwan [1 ]
Heylen, Dirk [1 ]
机构
[1] Univ Twente, Enschede, Netherlands
关键词
listener response; backchannel; continuer; prediction; head nod; vocalization; gaze; pitch; FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Backchannels (BCs) are short vocal and visual listener responses that signal attention, interest, and understanding to the speaker. Previous studies have investigated BC prediction in telephone-style dialogs from prosodic cues. In contrast, we consider spontaneous face-to-face dialogs. The additional visual modality allows speaker and listener to monitor each other's attention continuously, and we hypothesize that this affects the BC-inviting cues. In this study, we investigate how gaze, in addition to prosody, can cue BCs. Moreover, we focus on the type of BC performed, with the aim to find out whether vocal and visual BCs are invited by similar cues. In contrast to telephone-style dialogs, we do not find rising/falling pitch to be a BC-inviting cue. However, in a face-to-face setting, gaze appears to cue BCs. In addition, we find that mutual gaze occurs significantly more often during visual BCs. Moreover, vocal BCs are more likely to be timed during pauses in the speaker's speech.
引用
收藏
页码:2984 / 2987
页数:4
相关论文
共 50 条
  • [21] Reasoning Visual Dialogs with Structural and Partial Observations
    Zheng, Zilong
    Wang, Wenguan
    Qi, Siyuan
    Zhu, Song-Chun
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3662 - 6671
  • [22] MULTIMODAL ANALySIS Of AN INVENTED STORy: THE CASE Of THE VISUAL ONOMATOPOEIA
    Calil, Eduardo
    Del Re, Alessandra
    REVISTA DA ANPOLL, 2009, 2 (27) : 13 - 41
  • [23] Multimodal visual system analysis as a biomarker of visual hallucinations in Parkinson's disease
    Diez-Cirarda, Maria
    Cabrera-Zubizarreta, Alberto
    Murueta-Goyena, Ane
    Strafella, Antonio P.
    Del Pino, Rocio
    Acera, Marian
    Lucas-Jimenez, Olaia
    Ibarretxe-Bilbao, Naroa
    Tijero, Beatriz
    Gomez-Esteban, Juan Carlos
    Gabilondo, Inigo
    JOURNAL OF NEUROLOGY, 2023, 270 (01) : 519 - 529
  • [24] Developing Speech Dialogs For Multimodal HMIs Using Finite State Machines
    Goronzy, Silke
    Mochales, Raquel
    Beringer, Nicole
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1774 - 1777
  • [25] Dynamic based multi-agent architecture for multimedia multimodal dialogs
    Djenidi, H
    Tadj, C
    Ramdane-Cherif, A
    Levy, N
    IEEE WORKSHOP ON KNOWLEDGE MEDIA NETWORKING, PROCEEDINGS, 2002, : 107 - 113
  • [26] DiapixUK: task materials for the elicitation of multiple spontaneous speech dialogs
    Baker, Rachel
    Hazan, Valerie
    BEHAVIOR RESEARCH METHODS, 2011, 43 (03) : 761 - 770
  • [27] DiapixUK: task materials for the elicitation of multiple spontaneous speech dialogs
    Rachel Baker
    Valerie Hazan
    Behavior Research Methods, 2011, 43 : 761 - 770
  • [28] Deceptive vocal duets and multimodal display in a songbird
    Rek, Pawel
    Magrath, Robert D.
    PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2017, 284 (1864)
  • [29] The anuran vocal sac: a tool for multimodal signalling
    Starnberger, Iris
    Preininger, Doris
    Hoedl, Walter
    ANIMAL BEHAVIOUR, 2014, 97 : 281 - 288
  • [30] Visual and vocal recognition memory
    Carlson, HB
    Carr, HA
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1938, 23 : 523 - 530