Bridging the Gap Between Relevance Matching and Semantic Matching for Short Text Similarity Modeling

被引:0
|
作者
Rao, Jinfeng
Liu, Linqing [1 ]
Tay, Yi [2 ]
Yang, Wei [1 ]
Shi, Peng [1 ]
Lin, Jimmy [1 ]
机构
[1] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON, Canada
[2] Nanyang Technol Univ, Singapore, Singapore
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A core problem of information retrieval (IR) is relevance matching, which is to rank documents by relevance to a user's query. On the other hand, many NLP problems, such as question answering and paraphrase identification, can be considered variants of semantic matching, which is to measure the semantic distance between two pieces of short texts. While at a high level both relevance and semantic matching require modeling textual similarity, many existing techniques for one cannot be easily adapted to the other. To bridge this gap, we propose a novel model, HCAN (Hybrid Co-Attention Network), that comprises (1) a hybrid encoder module that includes ConvNet-based and LSTM-based encoders, (2) a relevance matching module that measures soft term matches with importance weighting at multiple granularities, and (3) a semantic matching module with co-attention mechanisms that capture context-aware semantic relatedness. Evaluations on multiple IR and NLP benchmarks demonstrate state-of-the-art effectiveness compared to approaches that do not exploit pretraining on external data. Extensive ablation studies suggest that relevance and semantic matching signals are complementary across many problem settings, regardless of the choice of underlying encoders.
引用
收藏
页码:5370 / 5381
页数:12
相关论文
共 50 条
  • [1] Interpreting similarity measures: Bridging the gap between schema matching and data integration
    Gal, Avigdor
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1 AND 2, 2008, : 345 - 352
  • [2] An Semantic Similarity Matching Method for Chinese Medical Question Text
    Wang, Liru
    Zhang, Tongxuan
    Tian, Jiewen
    Lin, Hongfei
    HEALTH INFORMATION PROCESSING, CHIP 2022, 2023, 1772 : 82 - 94
  • [3] A Fast Matching Method Based on Semantic Similarity for Short Texts
    Xu, Jiaming
    Liu, Pengcheng
    Wu, Gaowei
    Sun, Zhengya
    Xu, Bo
    Hao, Hongwei
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 299 - 309
  • [4] Enhanced Semantic Similarity Learning Framework for Image-Text Matching
    Zhang, Kun
    Hu, Bo
    Zhang, Huatian
    Li, Zhe
    Mao, Zhendong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2973 - 2988
  • [5] CSRS: Code Search with Relevance Matching and Semantic Matching
    Cheng, Yi
    Kuang, Li
    30TH IEEE/ACM INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2022), 2022, : 533 - 542
  • [6] A Short Text Matching Model Incorporating Contextual Semantic Differences
    Zhang W.
    Wang M.
    Hou Z.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2023, 59 (01): : 30 - 38
  • [7] Semantic similarity between sentences through approximate tree matching
    Ribadas, FJ
    Vilares, M
    Vilares, J
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2005, 3523 : 638 - 646
  • [8] Text matching to measure patent similarity
    Arts, Sam
    Cassiman, Bruno
    Carlos Gomez, Juan
    STRATEGIC MANAGEMENT JOURNAL, 2018, 39 (01) : 62 - 84
  • [9] SEMANTIC SIMILARITY MODELING BASED ON MULTI-GRANULARITY INTERACTION MATCHING
    Li, Xu
    Yao, Chunlong
    Zhang, Qinyang
    Zhang, Guoqi
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2019, 15 (05): : 1685 - 1700
  • [10] Towards Explainable Semantic Text Matching
    Landthaler, Joerg
    Glaser, Ingo
    Matthes, Florian
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS (JURIX 2018), 2018, 313 : 200 - 204