Coreference-aware Double-channel Attention Network for Multi-party Dialogue Reading Comprehension

被引:1
|
作者
Li, Yanling [1 ]
Zou, Bowei [2 ]
Fan, Yifan [1 ]
Dong, Mengxing [1 ]
Hong, Yu [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China
[2] ASTAR, Inst Infocomm Res, Singapore, Singapore
基金
国家重点研发计划; 美国国家科学基金会;
关键词
Multi-party dialogue reading comprehension; Coreference-aware attention; Utterance profiling; Interaction modeling;
D O I
10.1109/IJCNN54540.2023.10191414
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We tackle Multi-party Dialogue Reading Comprehension (abbr., MDRC). MDRC stands for an extractive reading comprehension task grounded on a batch of dialogues among multiple interlocutors. It is challenging due to the requirement of understanding cross-utterance contexts and relationships in a multi-turn multi-party conversation. Previous studies have made great efforts on the utterance profiling of a single interlocutor and graph-based interaction modeling. The corresponding solutions contribute to the answer-oriented reasoning on a series of well-organized and thread-aware conversational contexts. However, the current MDRC models still suffer from two bottlenecks. On the one hand, a pronoun like "it" most probably produces multiskip reasoning throughout the utterances of different interlocutors. On the other hand, an MDRC encoder is potentially puzzled by fuzzy features, i.e., the mixture of inner linguistic features in utterances and external interactive features among utterances. To overcome the bottlenecks, we propose a coreference-aware attention modeling method to strengthen the reasoning ability. In addition, we construct a two-channel encoding network. It separately encodes utterance profiles and interactive relationships, so as to relieve the confusion among heterogeneous features. We experiment on the benchmark corpora Molweni and FriendsQA. Experimental results demonstrate that our approach yields substantial improvements on both corpora, compared to the fine-tuned BERT and ELECTRA baselines. The maximum performance gain is about 2.5% F1-score. Besides, our MDRC models outperform the state-of-the-art in most cases.
引用
收藏
页数:8
相关论文
共 29 条
  • [21] A novel 3D shape recognition method based on double-channel attention residual network
    Ziping Ma
    Jie Zhou
    Jinlin Ma
    Tingting Li
    Multimedia Tools and Applications, 2022, 81 : 32519 - 32548
  • [22] ClueReader: Heterogeneous Graph Attention Network for Multi-Hop Machine Reading Comprehension
    Gao, Peng
    Gao, Feng
    Wang, Peng
    Ni, Jian-Cheng
    Wang, Fei
    Fujita, Hamido
    ELECTRONICS, 2023, 12 (14)
  • [23] Keywords-aware dynamic graph neural network for multi-hop reading comprehension
    Jia, Meihuizi
    Liao, Lejian
    Wang, Wenjing
    Li, Fei
    Chen, Zhendong
    Li, Jiaqi
    Huang, Heyan
    NEUROCOMPUTING, 2022, 501 : 25 - 40
  • [24] A Modality-Enhanced Multi-Channel Attention Network for Multi-Modal Dialogue Summarization
    Lu, Ming
    Liu, Yang
    Zhang, Xiaoming
    APPLIED SCIENCES-BASEL, 2024, 14 (20):
  • [25] Token-level Dynamic Self-Attention Network for Multi-Passage Reading Comprehension
    Zhuang, Yimeng
    Wang, Huadong
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2252 - 2262
  • [26] Load Prediction in Double-Channel Residual Self-Attention Temporal Convolutional Network with Weight Adaptive Updating in Cloud Computing
    Lin, Jiang
    Guan, Yepeng
    SENSORS, 2024, 24 (10)
  • [27] Double-Branch Multi-attention Mechanism Based Sharpness-Aware Classification Network
    Jiang W.
    Zhao L.
    Tu C.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (03): : 252 - 267
  • [28] Single image super-resolution via global aware external attention and multi-scale residual channel attention network
    Liu, Mingming
    Li, Sui
    Liu, Bing
    Yang, Yuxin
    Liu, Peng
    Zhang, Chen
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (06) : 2309 - 2321
  • [29] MSRA-Net: multi-channel semantic-aware and residual attention mechanism network for unsupervised 3D image registration
    Ren, Xiaozhen
    Song, Haoyuan
    Zhang, Zihao
    Yang, Tiejun
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (16):