Contextualized Embeddings based Transformer Encoder for Sentence Similarity Modeling in Answer Selection Task

被引:0
|
作者
Laskar, Md Tahmid Rahman [1 ,3 ]
Huang, Jimmy [2 ,3 ]
Hoque, Enamul [2 ]
机构
[1] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON, Canada
[2] York Univ, Sch Informat Technol, Toronto, ON, Canada
[3] York Univ, Informat Retrieval & Knowledge Management Res Lab, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Answer Selection; Transformer Encoder; Contextualized Embeddings; ELMo; BERT; RoBERTa; Deep Learning;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Word embeddings that consider context have attracted great attention for various natural language processing tasks in recent years. In this paper, we utilize contextualized word embeddings with the transformer encoder for sentence similarity modeling in the answer selection task. We present two different approaches (feature-based and fine-tuning-based) for answer selection. In the feature-based approach, we utilize two types of contextualized embeddings, namely the Embeddings from Language Models (ELMo) and the Bidirectional Encoder Representations from Transformers (BERT) and integrate each of them with the transformer encoder. We find that integrating these contextual embeddings with the transformer encoder is effective to improve the performance of sentence similarity modeling. In the second approach, we fine-tune two pre-trained transformer encoder models for the answer selection task. Based on our experiments on six datasets, we find that the fine-tuning approach outperforms the feature-based approach on all of them. Among our fine-tuning-based models, the Robustly Optimized BERT Pretraining Approach (RoBERTa) model results in new state-of-the-art performance across five datasets.
引用
收藏
页码:5505 / 5514
页数:10
相关论文
共 50 条
  • [21] Attention-based encoder-decoder model for answer selection in question answering
    Nie, Yuan-ping
    Han, Yi
    Huang, Jiu-ming
    Jiao, Bo
    Li, Ai-ping
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (04) : 535 - 544
  • [22] Transformer-Based Sequence Modeling Short Answer Assessment Framework
    Sharmila, P.
    Anbananthen, Kalaiarasi Sonai Muthu
    Chelliah, Deisy
    Parthasarathy, S.
    Balasubramaniam, Baarathi
    Lurudusamy, Saravanan Nathan
    HighTech and Innovation Journal, 2024, 5 (03): : 627 - 639
  • [23] Reference-based Weak Supervision for Answer Sentence Selection using Web Data
    Krishnamurthy, Vivek
    Vu, Thuy
    Moschitti, Alessandro
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4294 - 4299
  • [24] Empowering Short Answer Grading: Integrating Transformer-Based Embeddings and BI-LSTM Network
    Gomaa, Wael H.
    Nagib, Abdelrahman E.
    Saeed, Mostafa M.
    Algarni, Abdulmohsen
    Nabil, Emad
    BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (03)
  • [25] Modeling And Modification Of Converter Transformer Similarity Model Based On Finite Element And Similarity Theory
    Wang, Hao
    Zhang, Li
    Sun, Youliang
    Zhang, Zhuangzhuang
    Wang, Dong
    2022 4TH INTERNATIONAL CONFERENCE ON SMART POWER & INTERNET ENERGY SYSTEMS, SPIES, 2022, : 395 - 401
  • [26] Hierarchical Shared Encoder With Task-Specific Transformer Layer Selection for Emotion-Cause Pair Extraction
    Su, Xinxin
    Huang, Zhen
    Su, Yixin
    Trisedya, Bayu Distiawan
    Dou, Yong
    Zhao, Yunxiang
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (04) : 1934 - 1948
  • [27] Multi-modal Automatic Video Segmentation with Sentence Transformer Embeddings and KeyBERT-Based Subtopic Extraction
    Vasuki, M.
    Gangadharan, M. Arun
    Daniel, Jibin Thomas
    Sadashiv, Arjun
    Venugopal, Vivek
    Vekkot, Susmitha
    2024 2ND WORLD CONFERENCE ON COMMUNICATION & COMPUTING, WCONF 2024, 2024,
  • [28] TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning
    Wang, Kexin
    Reimers, Nils
    Gurevych, Iryna
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 671 - 688
  • [29] Fast reference frame selection based on content similarity for low complexity HEVC encoder
    Pan, Zhaoqing
    Jin, Peng
    Lei, Jianjun
    Zhang, Yun
    Sun, Xingming
    Kwong, Sam
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 40 : 516 - 524
  • [30] Semantic Aware Answer Sentence Selection using Self-Learning based Domain Adaptation
    Sarkar, Rajdeep
    Dutta, Sourav
    Assem, Haytham
    Arcan, Mihael
    McCrae, John
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3849 - 3857