LadRa-Net: Locally Aware Dynamic Reread Attention Net for Sentence Semantic Matching

被引:2
|
作者
Zhang, Kun [1 ,2 ]
Lv, Guangyi [3 ,4 ]
Wu, Le [1 ,2 ]
Chen, Enhong [4 ]
Liu, Qi [4 ]
Wang, Meng [1 ,2 ]
机构
[1] Hefei Univ Technol, Key Lab Knowledge Engn Big Data, Hefei 230029, Anhui, Peoples R China
[2] Hefei Univ Technol, Sch Comp & Informat, Hefei 230029, Anhui, Peoples R China
[3] Lenovo Res, AI Lab, Beijing 100094, Peoples R China
[4] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; Encoding; Bit error rate; Convolutional neural networks; Psychology; Nonhomogeneous media; Dynamic reread (DRr) attention; local structure; representation learning; sentence semantic matching; PERIPHERAL-VISION; CONSCIOUSNESS;
D O I
10.1109/TNNLS.2021.3103185
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentence semantic matching requires an agent to determine the semantic relation between two sentences, which is widely used in various natural language tasks, such as natural language inference (NLI) and paraphrase identification (PI). Much recent progress has been made in this area, especially attention-based methods and pretrained language model-based methods. However, most of these methods focus on all the important parts in sentences in a static way and only emphasize how important the words are to the query, inhibiting the ability of the attention mechanism. In order to overcome this problem and boost the performance of the attention mechanism, we propose a novel dynamic reread (DRr) attention, which can pay close attention to one small region of sentences at each step and reread the important parts for better sentence representations. Based on this attention variation, we develop a novel DRr network (DRr-Net) for sentence semantic matching. Moreover, selecting one small region in DRr attention seems insufficient for sentence semantics, and employing pretrained language models as input encoders will introduce incomplete and fragile representation problems. To this end, we extend DRr-Net to locally aware dynamic reread attention net (LadRa-Net), in which local structure of sentences is employed to alleviate the shortcoming of byte-pair encoding (BPE) in pretrained language models and boost the performance of DRr attention. Extensive experiments on two popular sentence semantic matching tasks demonstrate that DRr-Net can significantly improve the performance of sentence semantic matching. Meanwhile, LadRa-Net is able to achieve better performance by considering the local structures of sentences. In addition, it is exceedingly interesting that some discoveries in our experiments are consistent with some findings of psychological research.
引用
收藏
页码:853 / 866
页数:14
相关论文
共 50 条
  • [21] MSAA-Net: a multi-scale attention-aware U-Net is used to segment the liver
    Zhang, Lijuan
    Liu, Jiajun
    Li, Dongming
    Liu, Jinyuan
    Liu, Xiangkun
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1001 - 1009
  • [22] MSAA-Net: a multi-scale attention-aware U-Net is used to segment the liver
    Lijuan Zhang
    Jiajun Liu
    Dongming Li
    Jinyuan Liu
    Xiangkun Liu
    Signal, Image and Video Processing, 2023, 17 : 1001 - 1009
  • [23] BA-GCA Net: Boundary-Aware Grid Contextual Attention Net in Osteosarcoma MRI Image Segmentation
    Wu, Jia
    Liu, Zikang
    Gou, Fangfang
    Zhu, Jun
    Tang, Haoyu
    Zhou, Xian
    Xiong, Wangping
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [24] Improved U-Net based on ResNet and SE-Net with dual attention mechanism for glottis semantic segmentation
    Ni, Jui-Chung
    Lee, Shih-Hsiung
    Shen, Yen-Cheng
    Yang, Chu-Sing
    MEDICAL ENGINEERING & PHYSICS, 2025, 136
  • [25] AAEE-Net: Attention-guided aggregation and error-aware enhancement network for accurate and efficient stereo matching
    Liu, Yujun
    Zhang, Xiangchen
    Su, Jinhe
    Cai, Guorong
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (22):
  • [26] Visual-Semantic Graph Matching Net for Zero-Shot Learning
    Duan, Bowen
    Chen, Shiming
    Guo, Yufei
    Xie, Guo-Sen
    Ding, Weiping
    Wang, Yisong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [27] GA-NET: Global Attention Network for Point Cloud Semantic Segmentation
    Deng, Shuang
    Dong, Qiulei
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1300 - 1304
  • [28] FA-Net: feature attention network for semantic segmentation of ship port
    Xiong, Wei
    Cai, Mi
    Lv, Yafei
    Pei, Jiazheng
    GEOCARTO INTERNATIONAL, 2022, 37 (06) : 1744 - 1756
  • [29] Attention U-Net-based semantic segmentation for welding line detection
    Hunor István Lukács
    Bence Zsolt Beregi
    Balázs Porteleki
    Tamás Fischl
    János Botzheim
    Scientific Reports, 15 (1)
  • [30] Semantic Segmentation of Tumors in Kidneys using Attention U-Net Models
    Geethanjali, T. M.
    Minavathi
    Dinesh, M. S.
    2021 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER TECHNOLOGIES AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2021, : 286 - 290