A multimodal analysis of vocal and visual backchannels in spontaneous dialogs

被引:0
|
作者
Truong, Khiet P. [1 ]
Poppe, Ronald [1 ]
de Kok, Iwan [1 ]
Heylen, Dirk [1 ]
机构
[1] Univ Twente, Enschede, Netherlands
关键词
listener response; backchannel; continuer; prediction; head nod; vocalization; gaze; pitch; FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Backchannels (BCs) are short vocal and visual listener responses that signal attention, interest, and understanding to the speaker. Previous studies have investigated BC prediction in telephone-style dialogs from prosodic cues. In contrast, we consider spontaneous face-to-face dialogs. The additional visual modality allows speaker and listener to monitor each other's attention continuously, and we hypothesize that this affects the BC-inviting cues. In this study, we investigate how gaze, in addition to prosody, can cue BCs. Moreover, we focus on the type of BC performed, with the aim to find out whether vocal and visual BCs are invited by similar cues. In contrast to telephone-style dialogs, we do not find rising/falling pitch to be a BC-inviting cue. However, in a face-to-face setting, gaze appears to cue BCs. In addition, we find that mutual gaze occurs significantly more often during visual BCs. Moreover, vocal BCs are more likely to be timed during pauses in the speaker's speech.
引用
收藏
页码:2984 / 2987
页数:4
相关论文
共 50 条
  • [31] Correction to: Multimodal visual system analysis as a biomarker of visual hallucinations in Parkinson’s disease
    Maria Diez-Cirarda
    Alberto Cabrera-Zubizarreta
    Ane Murueta-Goyena
    Antonio P. Strafella
    Rocio Del Pino
    Marian Acera
    Olaia Lucas-Jiménez
    Naroa Ibarretxe-Bilbao
    Beatriz Tijero
    Juan Carlos Gómez-Esteban
    Iñigo Gabilondo
    Journal of Neurology, 2023, 270 : 530 - 530
  • [32] SPONTANEOUS RECOMBINATIONS OF VOCAL PATTERNS IN PARROTS
    TODT, D
    NATURWISSENSCHAFTEN, 1975, 62 (08) : 399 - 400
  • [33] iPoet: interactive painting poetry creation with visual multimodal analysis
    Feng, Yingchaojie
    Chen, Jiazhou
    Huang, Keyu
    Wong, Jason K.
    Ye, Hui
    Zhang, Wei
    Zhu, Rongchen
    Luo, Xiaonan
    Chen, Wei
    JOURNAL OF VISUALIZATION, 2022, 25 (03) : 671 - 685
  • [34] Multimodal Emotion Analysis Based on Visual, Acoustic and Linguistic Features
    Koren, Leon
    Stipancic, Tomislav
    Ricko, Andrija
    Orsag, Luka
    SOCIAL COMPUTING AND SOCIAL MEDIA: DESIGN, USER EXPERIENCE AND IMPACT, SCSM 2022, PT I, 2022, 13315 : 318 - 331
  • [35] A multimodal imaging analysis of the etiologies and visual outcomes of Pattern Dystrophy
    Chen, Kevin
    Jung, Jesse J.
    Yannuzzi, Lawrence A.
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2014, 55 (13)
  • [36] Designing and Evaluating Multimodal Interactions for Facilitating Visual Analysis With Dashboards
    Chowdhury, Imran
    Moeid, Abdul
    Hoque, Enamul
    Kabir, Muhammad Ashad
    Hossain, Md. Sabir
    Islam, Mohammad Mainul
    IEEE ACCESS, 2021, 9 (09): : 60 - 71
  • [37] Orko: Facilitating Multimodal Interaction for Visual Exploration and Analysis of Networks
    Srinivasan, Arjun
    Stasko, John
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (01) : 511 - 521
  • [38] VistaNet: Visual Aspect Attention Network for Multimodal Sentiment Analysis
    Quoc-Tuan Truong
    Lauw, Hady W.
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 305 - 312
  • [39] iPoet: interactive painting poetry creation with visual multimodal analysis
    Yingchaojie Feng
    Jiazhou Chen
    Keyu Huang
    Jason K. Wong
    Hui Ye
    Wei Zhang
    Rongchen Zhu
    Xiaonan Luo
    Wei Chen
    Journal of Visualization, 2022, 25 : 671 - 685
  • [40] Quantitative analysis of backchannels uttered by an interviewer during neuropsychological tests
    Bailly, Gerard
    Elisei, Frederic
    Juphard, Alexandra
    Moreaud, Olivier
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2905 - 2909