MULTIMODAL ADDRESSEE DETECTION IN MULTIPARTY DIALOGUE SYSTEMS

被引:0
|
作者
Tsai, T. J. [1 ]
Stolcke, Andreas [2 ]
Slaney, Malcolm [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Microsoft Res, Mountain View, CA USA
来源
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年
关键词
addressee detection; dialog system; multimodality; multiparty; human-human-computer;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Addressee detection answers the question, "Are you talking to me?" When multiple users interact with a dialogue system, it is important to know when a user is speaking to the computer and when he or she is speaking to another person. We approach this problem from a multimodal perspective, using lexical, acoustic, visual, dialog state, and beam-forming information. Using data from a multiparty dialogue system, we demonstrate the benefit of using multiple modalities over using a single modality. We also assess the relative importance of the various modalities in predicting the addressee. In our experiments, we find that acoustic features are by far the most important, that ASR and system-state information are useful, and that visual and beamforming features provide little additional benefit. Our study suggests that acoustic, lexical, and system state information are an effective, economical combination of modalities to use in addressee detection.
引用
收藏
页码:2314 / 2318
页数:5
相关论文
共 50 条
  • [31] A multiparty conversation system with an addressee identification mechanism based on nonverbal information
    Nakano, Yukiko
    Baba, Naoya
    Huang, Hung-Hsuan
    Hayashi, Yuki
    Transactions of the Japanese Society for Artificial Intelligence, 2014, 29 (01) : 69 - 79
  • [32] Are You Addressing Me? Multimodal Addressee Detection in Human-Human-Computer Conversations
    Akhtiamov, Oleg
    Ubskii, Dmitrii
    Feldina, Evgeniia
    Pugachev, Aleksei
    Karpov, Alexey
    Minker, Wolfgang
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 152 - 161
  • [33] Speech and Text Analysis for Multimodal Addressee Detection in Human-Human-Computer Interaction
    Akhtiamov, Oleg
    Sidorov, Maxim
    Karpov, Alexey
    Minker, Wolfgang
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2521 - 2525
  • [34] Why are there no Mandelstam's Epistles: Addressee and Dialogue in "Stone"
    Artemova, S. Yu
    NOVYI FILOLOGICHESKII VESTNIK-NEW PHILOLOGICAL BULLETIN, 2015, (32): : 108 - 112
  • [35] Implicit and explicit reference to the addressee in dialogue communication in Slovak
    Kesselova, Jana
    SKASE JOURNAL OF THEORETICAL LINGUISTICS, 2019, 16 (03): : 56 - 79
  • [36] On the Influence of Gender on Interruptions in Multiparty Dialogue
    Van Eecke, Paul
    Fernandez, Raquel
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2070 - 2074
  • [37] Discourse Structure and Dialogue Acts in Multiparty Dialogue: the STAC Corpus
    Asher, Nicholas
    Hunter, Julie
    Morey, Mathieu
    Benamara, Farah
    Afantenos, Stergos
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2721 - 2727
  • [38] Multilingual Coreference Resolution in Multiparty Dialogue
    Zheng, Boyuan
    Xia, Patrick
    Yarmohammadi, Mahsa
    Van Durme, Benjamin
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 922 - 940
  • [39] Automatic addressee identification based on participants' head orientation and utterances for multiparty conversations
    Takemae, Yoshinao
    Ozawa, Shinji
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1285 - 1288
  • [40] Acoustic-Based Automatic Addressee Detection for Technical Systems: A Review
    Siegert, Ingo
    Weisskirchen, Norman
    Wendemuth, Andreas
    FRONTIERS IN COMPUTER SCIENCE, 2022, 4