A multimodal analysis of vocal and visual backchannels in spontaneous dialogs

被引：0

作者：

Truong, Khiet P. ^{[1
]}

Poppe, Ronald ^{[1
]}

de Kok, Iwan ^{[1
]}

Heylen, Dirk ^{[1
]}

机构：

[1] Univ Twente, Enschede, Netherlands

来源：

12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年

关键词：

listener response; backchannel; continuer; prediction; head nod; vocalization; gaze; pitch; FEATURES;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Backchannels (BCs) are short vocal and visual listener responses that signal attention, interest, and understanding to the speaker. Previous studies have investigated BC prediction in telephone-style dialogs from prosodic cues. In contrast, we consider spontaneous face-to-face dialogs. The additional visual modality allows speaker and listener to monitor each other's attention continuously, and we hypothesize that this affects the BC-inviting cues. In this study, we investigate how gaze, in addition to prosody, can cue BCs. Moreover, we focus on the type of BC performed, with the aim to find out whether vocal and visual BCs are invited by similar cues. In contrast to telephone-style dialogs, we do not find rising/falling pitch to be a BC-inviting cue. However, in a face-to-face setting, gaze appears to cue BCs. In addition, we find that mutual gaze occurs significantly more often during visual BCs. Moreover, vocal BCs are more likely to be timed during pauses in the speaker's speech.

引用

页码：2984 / 2987

页数：4

共 50 条

[31] Correction to: Multimodal visual system analysis as a biomarker of visual hallucinations in Parkinson’s disease
Maria Diez-Cirarda
Alberto Cabrera-Zubizarreta
Ane Murueta-Goyena
Antonio P. Strafella
Rocio Del Pino
Marian Acera
Olaia Lucas-Jiménez
Naroa Ibarretxe-Bilbao
Beatriz Tijero
Juan Carlos Gómez-Esteban
Iñigo Gabilondo
Journal of Neurology, 2023, 270 : 530 - 530
[32] SPONTANEOUS RECOMBINATIONS OF VOCAL PATTERNS IN PARROTS
TODT, D
NATURWISSENSCHAFTEN, 1975, 62 (08) : 399 - 400
[33] iPoet: interactive painting poetry creation with visual multimodal analysis
Feng, Yingchaojie
Chen, Jiazhou
Huang, Keyu
Wong, Jason K.
Ye, Hui
Zhang, Wei
Zhu, Rongchen
Luo, Xiaonan
Chen, Wei
JOURNAL OF VISUALIZATION, 2022, 25 (03) : 671 - 685
[34] Multimodal Emotion Analysis Based on Visual, Acoustic and Linguistic Features
Koren, Leon
Stipancic, Tomislav
Ricko, Andrija
Orsag, Luka
SOCIAL COMPUTING AND SOCIAL MEDIA: DESIGN, USER EXPERIENCE AND IMPACT, SCSM 2022, PT I, 2022, 13315 : 318 - 331
[35] A multimodal imaging analysis of the etiologies and visual outcomes of Pattern Dystrophy
Chen, Kevin
Jung, Jesse J.
Yannuzzi, Lawrence A.
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2014, 55 (13)
[36] Designing and Evaluating Multimodal Interactions for Facilitating Visual Analysis With Dashboards
Chowdhury, Imran
Moeid, Abdul
Hoque, Enamul
Kabir, Muhammad Ashad
Hossain, Md. Sabir
Islam, Mohammad Mainul
IEEE ACCESS, 2021, 9 (09): : 60 - 71
[37] Orko: Facilitating Multimodal Interaction for Visual Exploration and Analysis of Networks
Srinivasan, Arjun
Stasko, John
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (01) : 511 - 521
[38] VistaNet: Visual Aspect Attention Network for Multimodal Sentiment Analysis
Quoc-Tuan Truong
Lauw, Hady W.
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 305 - 312
[39] iPoet: interactive painting poetry creation with visual multimodal analysis
Yingchaojie Feng
Jiazhou Chen
Keyu Huang
Jason K. Wong
Hui Ye
Wei Zhang
Rongchen Zhu
Xiaonan Luo
Wei Chen
Journal of Visualization, 2022, 25 : 671 - 685
[40] Quantitative analysis of backchannels uttered by an interviewer during neuropsychological tests
Bailly, Gerard
Elisei, Frederic
Juphard, Alexandra
Moreaud, Olivier
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2905 - 2909

← 1 2 3 4 5 →