LadRa-Net: Locally Aware Dynamic Reread Attention Net for Sentence Semantic Matching

被引:2
|
作者
Zhang, Kun [1 ,2 ]
Lv, Guangyi [3 ,4 ]
Wu, Le [1 ,2 ]
Chen, Enhong [4 ]
Liu, Qi [4 ]
Wang, Meng [1 ,2 ]
机构
[1] Hefei Univ Technol, Key Lab Knowledge Engn Big Data, Hefei 230029, Anhui, Peoples R China
[2] Hefei Univ Technol, Sch Comp & Informat, Hefei 230029, Anhui, Peoples R China
[3] Lenovo Res, AI Lab, Beijing 100094, Peoples R China
[4] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; Encoding; Bit error rate; Convolutional neural networks; Psychology; Nonhomogeneous media; Dynamic reread (DRr) attention; local structure; representation learning; sentence semantic matching; PERIPHERAL-VISION; CONSCIOUSNESS;
D O I
10.1109/TNNLS.2021.3103185
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentence semantic matching requires an agent to determine the semantic relation between two sentences, which is widely used in various natural language tasks, such as natural language inference (NLI) and paraphrase identification (PI). Much recent progress has been made in this area, especially attention-based methods and pretrained language model-based methods. However, most of these methods focus on all the important parts in sentences in a static way and only emphasize how important the words are to the query, inhibiting the ability of the attention mechanism. In order to overcome this problem and boost the performance of the attention mechanism, we propose a novel dynamic reread (DRr) attention, which can pay close attention to one small region of sentences at each step and reread the important parts for better sentence representations. Based on this attention variation, we develop a novel DRr network (DRr-Net) for sentence semantic matching. Moreover, selecting one small region in DRr attention seems insufficient for sentence semantics, and employing pretrained language models as input encoders will introduce incomplete and fragile representation problems. To this end, we extend DRr-Net to locally aware dynamic reread attention net (LadRa-Net), in which local structure of sentences is employed to alleviate the shortcoming of byte-pair encoding (BPE) in pretrained language models and boost the performance of DRr attention. Extensive experiments on two popular sentence semantic matching tasks demonstrate that DRr-Net can significantly improve the performance of sentence semantic matching. Meanwhile, LadRa-Net is able to achieve better performance by considering the local structures of sentences. In addition, it is exceedingly interesting that some discoveries in our experiments are consistent with some findings of psychological research.
引用
收藏
页码:853 / 866
页数:14
相关论文
共 50 条
  • [1] DRr-Net: Dynamic Re-Read Network for Sentence Semantic Matching
    Zhang, Kun
    Lv, Guangyi
    Wang, Linyuan
    Wu, Le
    Chen, Enhong
    Wu, Fangzhao
    Xie, Xing
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 7442 - 7449
  • [2] SAB Net: A Semantic Attention Boosting Framework for Semantic Segmentation
    Ding, Xiaofeng
    Shen, Chaomin
    Zeng, Tieyong
    Peng, Yaxin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022,
  • [3] Bi-directional attention comparison for semantic sentence matching
    Lai, Huiyuan
    Tao, Yizheng
    Wang, Chunliu
    Xu, Lunfan
    Tang, Dingyong
    Li, Gongliang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 14609 - 14624
  • [4] Bi-directional attention comparison for semantic sentence matching
    Huiyuan Lai
    Yizheng Tao
    Chunliu Wang
    Lunfan Xu
    Dingyong Tang
    Gongliang Li
    Multimedia Tools and Applications, 2020, 79 : 14609 - 14624
  • [5] PS2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation
    Zhao, Na
    Chua, Tat-Seng
    Lee, Gim Hee
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 723 - 730
  • [6] Enhanced distance-aware self-attention and multi-level match for sentence semantic matching
    Deng, Yao
    Li, Xianfeng
    Zhang, Mengyan
    Lu, Xin
    Sun, Xia
    NEUROCOMPUTING, 2022, 501 : 174 - 187
  • [7] Attention-augmented U-Net (AA-U-Net) for semantic segmentation
    Rajamani, Kumar T.
    Rani, Priya
    Siebert, Hanna
    ElagiriRamalingam, Rajkumar
    Heinrich, Mattias P.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 981 - 989
  • [8] Attention-augmented U-Net (AA-U-Net) for semantic segmentation
    Kumar T. Rajamani
    Priya Rani
    Hanna Siebert
    Rajkumar ElagiriRamalingam
    Mattias P. Heinrich
    Signal, Image and Video Processing, 2023, 17 : 981 - 989
  • [9] Attention-aware semantic relevance predicting Chinese sentence reading
    Sun, Kun
    Liu, Haitao
    COGNITION, 2025, 255
  • [10] Context-Aware Semantic Matching with Self Attention Mechanism
    Chen, Yanmin
    Wang, Hao
    Sun, Ruijun
    Chen, Enhong
    2022 5th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2022, 2022, : 1007 - 1011