Coreference-aware Double-channel Attention Network for Multi-party Dialogue Reading Comprehension

被引：1

作者：

Li, Yanling ^{[1
]}

Zou, Bowei ^{[2
]}

Fan, Yifan ^{[1
]}

Dong, Mengxing ^{[1
]}

Hong, Yu ^{[1
]}

机构：

[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China

[2] ASTAR, Inst Infocomm Res, Singapore, Singapore

来源：

2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年

基金：

国家重点研发计划; 美国国家科学基金会;

关键词：

Multi-party dialogue reading comprehension; Coreference-aware attention; Utterance profiling; Interaction modeling;

D O I：

10.1109/IJCNN54540.2023.10191414

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We tackle Multi-party Dialogue Reading Comprehension (abbr., MDRC). MDRC stands for an extractive reading comprehension task grounded on a batch of dialogues among multiple interlocutors. It is challenging due to the requirement of understanding cross-utterance contexts and relationships in a multi-turn multi-party conversation. Previous studies have made great efforts on the utterance profiling of a single interlocutor and graph-based interaction modeling. The corresponding solutions contribute to the answer-oriented reasoning on a series of well-organized and thread-aware conversational contexts. However, the current MDRC models still suffer from two bottlenecks. On the one hand, a pronoun like "it" most probably produces multiskip reasoning throughout the utterances of different interlocutors. On the other hand, an MDRC encoder is potentially puzzled by fuzzy features, i.e., the mixture of inner linguistic features in utterances and external interactive features among utterances. To overcome the bottlenecks, we propose a coreference-aware attention modeling method to strengthen the reasoning ability. In addition, we construct a two-channel encoding network. It separately encodes utterance profiles and interactive relationships, so as to relieve the confusion among heterogeneous features. We experiment on the benchmark corpora Molweni and FriendsQA. Experimental results demonstrate that our approach yields substantial improvements on both corpora, compared to the fine-tuned BERT and ELECTRA baselines. The maximum performance gain is about 2.5% F1-score. Besides, our MDRC models outperform the state-of-the-art in most cases.

引用

页数：8

共 29 条

[21] A novel 3D shape recognition method based on double-channel attention residual network
Ziping Ma
Jie Zhou
Jinlin Ma
Tingting Li
Multimedia Tools and Applications, 2022, 81 : 32519 - 32548
[22] ClueReader: Heterogeneous Graph Attention Network for Multi-Hop Machine Reading Comprehension
Gao, Peng
Gao, Feng
Wang, Peng
Ni, Jian-Cheng
Wang, Fei
Fujita, Hamido
ELECTRONICS, 2023, 12 (14)
[23] Keywords-aware dynamic graph neural network for multi-hop reading comprehension
Jia, Meihuizi
Liao, Lejian
Wang, Wenjing
Li, Fei
Chen, Zhendong
Li, Jiaqi
Huang, Heyan
NEUROCOMPUTING, 2022, 501 : 25 - 40
[24] A Modality-Enhanced Multi-Channel Attention Network for Multi-Modal Dialogue Summarization
Lu, Ming
Liu, Yang
Zhang, Xiaoming
APPLIED SCIENCES-BASEL, 2024, 14 (20):
[25] Token-level Dynamic Self-Attention Network for Multi-Passage Reading Comprehension
Zhuang, Yimeng
Wang, Huadong
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2252 - 2262
[26] Load Prediction in Double-Channel Residual Self-Attention Temporal Convolutional Network with Weight Adaptive Updating in Cloud Computing
Lin, Jiang
Guan, Yepeng
SENSORS, 2024, 24 (10)
[27] Double-Branch Multi-attention Mechanism Based Sharpness-Aware Classification Network
Jiang W.
Zhao L.
Tu C.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (03): : 252 - 267
[28] Single image super-resolution via global aware external attention and multi-scale residual channel attention network
Liu, Mingming
Li, Sui
Liu, Bing
Yang, Yuxin
Liu, Peng
Zhang, Chen
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (06) : 2309 - 2321
[29] MSRA-Net: multi-channel semantic-aware and residual attention mechanism network for unsupervised 3D image registration
Ren, Xiaozhen
Song, Haoyuan
Zhang, Zihao
Yang, Tiejun
PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (16):

← 1 2 3 →