Predicting Who Will Be the Next Speaker and When in Multi-party Meetings

被引:0
|
作者
Ishii, Ryo
Otsuka, Kazuhiro
Kumano, Shiro
Yamato, Junji
机构
来源
NTT Technical Review | 2015年 / 13卷 / 07期
关键词
Speech processing - Video conferencing;
D O I
暂无
中图分类号
学科分类号
摘要
An understanding of the mechanisms involved in face-to-face communication will contribute to designing advanced video conferencing and dialogue systems. Turn-taking, the situation where the speaker changes, is especially important in multi-party meetings. For smooth turn-taking, the participants need to predict who will start speaking next and to consider a strategy for achieving good timing to speak next. Our aim is to clarify the kinds of behavior that contribute to smooth turn-taking and to develop a model for predicting the next speaker and the start time of the next speaker’s utterance in multi-party meetings. We focus on gaze behavior and respiration near the end of the current speaker’s utterance. We empirically demonstrate that gaze behavior and respiration have a relation to the next speaker and the start timing of the next utterance in multi-party meetings. A prediction model based on the results reveals that gaze behavior and respiration contribute to predicting the next speaker and the timing of the next utterance. © 2015 Nippon Telegraph and Telephone Corp.. All rights reserved.
引用
收藏
相关论文
共 50 条
  • [21] Interpreters' involvement in multi-party interactions: The nature of participation as listener and speaker
    Takimoto, Masato
    MULTILINGUA-JOURNAL OF CROSS-CULTURAL AND INTERLANGUAGE COMMUNICATION, 2012, 31 (01): : 35 - 53
  • [22] Enhanced Speaker-Aware Multi-Party Multi-Turn Dialogue Comprehension
    Ma, Xinbei
    Zhang, Zhuosheng
    Zhao, Hai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2410 - 2423
  • [23] Spatial-aware Speaker Diarizaiton for Multi-channel Multi-party Meeting
    Wang, Jie
    Liu, Yuji
    Wang, Binling
    Zhi, Yiming
    Li, Song
    Xia, Shipeng
    Zhang, Jiayang
    Tong, Feng
    Li, Lin
    Hong, Qingyang
    INTERSPEECH 2022, 2022, : 1491 - 1495
  • [24] Linear Discourse Segmentation of Multi-Party Meetings Based on Local and Global Information
    Bokaei, Mohammad Hadi
    Sameti, Hossein
    Liu, Yang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1879 - 1891
  • [25] Threshold quantum secret sharing between multi-party and multi-party
    YANG YuGuang1
    2 State Key Laboratory of Integrated Services Network
    3 State Key Laboratory of Information Security (Graduate University of Chinese Academy of Sciences)
    4 State Key Laboratory of Networking and Switching Technology
    Science China(Physics,Mechanics & Astronomy), 2008, (09) : 1308 - 1315
  • [26] Effective Speaker Tracking Strategies for Multi-party Human-Computer Dialogue
    Popescu, Vladimir
    Burileanu, Corneliu
    Caelen, Jean
    INTELLIGENT SYSTEMS AND TECHNOLOGIES: METHODS AND APPLICATIONS, 2009, 217 : 193 - +
  • [27] Towards Neural Speaker Modeling in Multi-Party Conversation: The Task, Dataset, and Models
    Meng, Zhao
    Mou, Lili
    Jin, Zhi
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8121 - 8122
  • [28] Towards Neural Speaker Modeling in Multi-Party Conversation: The Task, Dataset, and Models
    Meng, Zhao
    Mou, Lili
    Jin, Zhi
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3142 - 3145
  • [29] Estimating the dominant person in multi-party conversations using speaker diarization strategies
    Hung, Hayley
    Huang, Yan
    Friedland, Gerald
    Gatica-Perez, Daniel
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 2197 - +
  • [30] Threshold quantum secret sharing between multi-party and multi-party
    YuGuang Yang
    QiaoYan Wen
    Science in China Series G: Physics, Mechanics and Astronomy, 2008, 51 : 1308 - 1315