Contextualized Embeddings based Transformer Encoder for Sentence Similarity Modeling in Answer Selection Task

被引:0
|
作者
Laskar, Md Tahmid Rahman [1 ,3 ]
Huang, Jimmy [2 ,3 ]
Hoque, Enamul [2 ]
机构
[1] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON, Canada
[2] York Univ, Sch Informat Technol, Toronto, ON, Canada
[3] York Univ, Informat Retrieval & Knowledge Management Res Lab, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Answer Selection; Transformer Encoder; Contextualized Embeddings; ELMo; BERT; RoBERTa; Deep Learning;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Word embeddings that consider context have attracted great attention for various natural language processing tasks in recent years. In this paper, we utilize contextualized word embeddings with the transformer encoder for sentence similarity modeling in the answer selection task. We present two different approaches (feature-based and fine-tuning-based) for answer selection. In the feature-based approach, we utilize two types of contextualized embeddings, namely the Embeddings from Language Models (ELMo) and the Bidirectional Encoder Representations from Transformers (BERT) and integrate each of them with the transformer encoder. We find that integrating these contextual embeddings with the transformer encoder is effective to improve the performance of sentence similarity modeling. In the second approach, we fine-tune two pre-trained transformer encoder models for the answer selection task. Based on our experiments on six datasets, we find that the fine-tuning approach outperforms the feature-based approach on all of them. Among our fine-tuning-based models, the Robustly Optimized BERT Pretraining Approach (RoBERTa) model results in new state-of-the-art performance across five datasets.
引用
收藏
页码:5505 / 5514
页数:10
相关论文
共 50 条
  • [41] MUSIPER: a system for modeling music similarity perception based on objective feature subset selection
    Sotiropoulos, Dionysios N.
    Lampropoulos, Aristomenis S.
    Tsihrintzis, George A.
    USER MODELING AND USER-ADAPTED INTERACTION, 2008, 18 (04) : 315 - 348
  • [42] MUSIPER: a system for modeling music similarity perception based on objective feature subset selection
    Dionysios N. Sotiropoulos
    Aristomenis S. Lampropoulos
    George A. Tsihrintzis
    User Modeling and User-Adapted Interaction, 2008, 18 : 315 - 348
  • [43] Intelligent question and answer system for building information modeling and artificial intelligence of things based on the bidirectional encoder representations from transformers model
    Lin, Tzu-Hsuan
    Huang, Yu-Hua
    Putranto, Alan
    AUTOMATION IN CONSTRUCTION, 2022, 142
  • [44] Improved best prediction mode(s) selection methods based on structural similarity in H.264 I-frame encoder
    Mai, ZY
    Yang, CL
    Xie, SL
    INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 2673 - 2678
  • [45] Learning and Modeling Unit Embeddings Using Deep Neural Networks for Unit-Selection-Based Mandarin Speech Synthesis
    Zhou, Xiao
    Ling, Zhen-Hua
    Dai, Li-Rong
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (03)
  • [46] Transformer-based modeling of Clonal Selection and Expression Dynamics reveals resistance mechanisms in breast cancer
    Maulding, Nathan D.
    Zou, Jun
    Zhou, Wei
    Metcalfe, Ciara
    Stuart, Joshua M.
    Ye, Xin
    Hafner, Marc
    NPJ SYSTEMS BIOLOGY AND APPLICATIONS, 2025, 11 (01)
  • [47] CSECU-DSG at SemEval-2022 Task 11: Identifying the Multilingual Complex Named Entity in Text Using Stacked Embeddings and Transformer based Approach
    Aziz, Abdul
    Hossain, Md. Akram
    Chy, Abu Nowshed
    PROCEEDINGS OF THE 16TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2022, 2022, : 1549 - 1555
  • [48] Multi-task supply-demand prediction and reliability analysis for docked bike-sharing systems via transformer-encoder-based neural processes
    Xu, Meng
    Di, Yining
    Yang, Hai
    Chen, Xiqun
    Zhu, Zheng
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 147
  • [49] MBZUAI-UNAM at SemEval-2024 Task 1: Sentence-CROBI, a Simple Cross-Bi-Encoder-Based Neural Network Architecture for Semantic Textual Relatedness
    Ortiz-Barajas, Jesus-German
    Bel-Enguix, Gemma
    Gomez-Adorno, Helena
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1071 - 1079
  • [50] A multi-indicator modeling method for similarity-based residual useful life estimation with two selection processes
    Gu M.
    Chen Y.
    International Journal of System Assurance Engineering and Management, 2018, 9 (5) : 987 - 998