Enhanced distance-aware self-attention and multi-level match for sentence semantic matching

被引:6
|
作者
Deng, Yao [1 ]
Li, Xianfeng [1 ]
Zhang, Mengyan [1 ]
Lu, Xin [1 ]
Sun, Xia [1 ]
机构
[1] Northwest Univ, Sch Informat Sci & Technol, Xian 710127, Peoples R China
关键词
Sentence Matching; Distance-aware Self-attention; Multi-level Match; Representation learning; NETWORK;
D O I
10.1016/j.neucom.2022.05.103
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentence semantic matching is a core research area in natural language processing, which is widely used in various natural language tasks. In recent years, attention mechanism has shown good performance in deep neural networks for sentence semantic matching. Most of the attention-based deep neural networks focus on sentences interaction which ignore modeling the core semantic of the sentence. In other words, they do not consider the importance of the relative distance of words when modeling the sentence semantics, which leads to deviations in modeling the core semantics of the sentence and unstable sentence interaction. Usually, people tend to associate words that are relatively close together when they read and believe that there is a deeper connection between them. Besides, the current interactive matching method after sentence modeling is relatively simple and it may be inadequate. In this paper, we build a well-performed distance-aware self-attention and multi-level matching model (DSSTM) for sentence semantic matching tasks. By considering the importance of different distance tokens, it can get the better original semantics of sentences and hold interactive matching method in multiple level after sentence modeling. To be specific, given two input sentences, we first encode them as contextual embeddings. Then, the contextual embeddings are handled by enhanced distance-aware self-attention to further strengthen the sentence semantic modeling from the whole and local aspect. At the same time, we apply the co-attention layer to extract cross-sentence interaction features while simplifying all the remaining components. Finally, we fuse them into the multi-level matching function to obtain the aggregation vector and learn divers matching representations, which is helpful to capture the diversity of sentence pairs. We conduct experiments on three sentence semantic matching tasks. Experimental results on these public datasets demonstrate that our model outperforms competitive baseline methods and our model has fewer parameters. Our source code is publicly available at https://github.com/xiaodeng-1/DSSTM.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:174 / 187
页数:14
相关论文
共 50 条
  • [21] Building Change Detection in Remote Sensing Imagery with Focal Self-Attention and Multi-Level Feature Fusion
    Shen, Peiquan
    Mei, Liye
    Ye, Zhaoyi
    Wang, Ying
    Zhang, Qi
    Hong, Bo
    Yin, Xiliang
    Yang, Wei
    ELECTRONICS, 2023, 12 (13)
  • [22] Chinese sentence semantic matching based on multi-level relevance extraction and aggregation for intelligent human-robot interaction
    Lu, Wenpeng
    Zhao, Pengyu
    Li, Yifeng
    Wang, Shoujin
    Huang, Heyan
    Shi, Shumin
    Wu, Hao
    APPLIED SOFT COMPUTING, 2022, 131
  • [23] Structure-aware person search with self-attention and online instance aggregation matching
    Gao, Cunyuan
    Yao, Rui
    Zhao, Jiaqi
    Zhou, Yong
    Hu, Fuyuan
    Li, Leida
    NEUROCOMPUTING, 2019, 369 : 29 - 38
  • [24] Multi-query and multi-level enhanced network for semantic segmentation
    Xie, Bin
    Cao, Jiale
    Anwer, Rao Muhammad
    Xie, Jin
    Nie, Jing
    Yang, Aiping
    Pang, Yanwei
    PATTERN RECOGNITION, 2024, 156
  • [25] Lunet: an enhanced upsampling fusion network with efficient self-attention for semantic segmentation
    Zhou, Yan
    Zhou, Haibin
    Yang, Yin
    Li, Jianxun
    Irampaye, Richard
    Wang, Dongli
    Zhang, Zhengpeng
    VISUAL COMPUTER, 2024, : 3109 - 3128
  • [26] A semantic-augmented multi-level matching model of Web services
    Liu M.
    Gao Q.
    Shen W.
    Hao Q.
    Yan J.
    Service Oriented Computing and Applications, 2009, 3 (3) : 205 - 215
  • [27] A multi-level matching framework for Semantic Web Services in collaborative design
    Liu, Min
    Shen, Weiming
    Hao, Qi
    Yan, Junwei
    Gao, Qi
    PROCEEDINGS OF THE 2008 12TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, VOLS I AND II, 2008, : 392 - +
  • [28] Image Captioning with multi-level similarity-guided semantic matching
    Li, Jiesi
    Xu, Ning
    Nie, Weizhi
    Zhang, Shenyuan
    VISUAL INFORMATICS, 2021, 5 (04): : 41 - 48
  • [29] Multi-layered self-attention mechanism for weakly supervised semantic segmentation
    Yaganapu, Avinash
    Kang, Mingon
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 239
  • [30] CoMaSa:Context Multi-aware Self-attention for emotional response generation
    Zheng, Xin
    Du, Yajun
    Qin, Xiaolin
    NEUROCOMPUTING, 2025, 611