Contextualized Embeddings based Transformer Encoder for Sentence Similarity Modeling in Answer Selection Task

被引:0
|
作者
Laskar, Md Tahmid Rahman [1 ,3 ]
Huang, Jimmy [2 ,3 ]
Hoque, Enamul [2 ]
机构
[1] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON, Canada
[2] York Univ, Sch Informat Technol, Toronto, ON, Canada
[3] York Univ, Informat Retrieval & Knowledge Management Res Lab, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Answer Selection; Transformer Encoder; Contextualized Embeddings; ELMo; BERT; RoBERTa; Deep Learning;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Word embeddings that consider context have attracted great attention for various natural language processing tasks in recent years. In this paper, we utilize contextualized word embeddings with the transformer encoder for sentence similarity modeling in the answer selection task. We present two different approaches (feature-based and fine-tuning-based) for answer selection. In the feature-based approach, we utilize two types of contextualized embeddings, namely the Embeddings from Language Models (ELMo) and the Bidirectional Encoder Representations from Transformers (BERT) and integrate each of them with the transformer encoder. We find that integrating these contextual embeddings with the transformer encoder is effective to improve the performance of sentence similarity modeling. In the second approach, we fine-tune two pre-trained transformer encoder models for the answer selection task. Based on our experiments on six datasets, we find that the fine-tuning approach outperforms the feature-based approach on all of them. Among our fine-tuning-based models, the Robustly Optimized BERT Pretraining Approach (RoBERTa) model results in new state-of-the-art performance across five datasets.
引用
收藏
页码:5505 / 5514
页数:10
相关论文
共 50 条
  • [31] Auto-Scoring Feature Based on Sentence Transformer Similarity Check with Korean Sentences Spoken by Foreigners
    Wahyutama, Aria Bisma
    Hwang, Mintae
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [32] Submodularity-Inspired Data Selection for Goal-Oriented Chatbot Training Based on Sentence Embeddings
    Dimovski, Mladen
    Musat, Claudiu
    Ilievski, Vladimir
    Hossmann, Andreea
    Baeriswyl, Michael
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4019 - 4025
  • [33] Sentence Pair Similarity Modeling Based on Weighted Interaction of Multi-semantic Embedding Matrix
    Chen, Junyu
    Zhu, Xiaohong
    Sang, Jun
    Gong, Lu
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 1118 - 1123
  • [34] GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks
    Ma, Weicheng
    Lou, Renze
    Zhang, Kai
    Wang, Lili
    Vosoughi, Soroush
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5621 - 5632
  • [35] Semantic Interest Modeling and Content-Based Scientific Publication Recommendation Using Word Embeddings and Sentence Encoders
    Guesmi, Mouadh
    Chatti, Mohamed Amine
    Kadhim, Lamees
    Joarder, Shoeb
    Ain, Qurat Ul
    MULTIMODAL TECHNOLOGIES AND INTERACTION, 2023, 7 (09)
  • [36] Bearing life prognosis based on monotonic feature selection and similarity modeling
    Niu, Gang
    Qian, Fang
    Choi, Byeong-Keun
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2016, 230 (18) : 3183 - 3193
  • [37] Conditional checkpoint selection strategy based on sentence structures for text to triple translation using BiLSTM encoder-decoder model
    Shrivastava, Manu
    Shibata, Kosei
    Wagatsuma, Hiroaki
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [38] Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis
    Zhou, Xiao
    Ling, Zhen-Hua
    Zhou, Zhi-Ping
    Dai, Li-Rong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2509 - 2513
  • [39] Vision Intelligence Assisted Lung Function Estimation Based on Transformer Encoder-Decoder Network with Invertible Modeling
    Chen L.
    Lu D.
    Zhai J.
    Cai K.
    Wang L.
    Zhang Z.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (07): : 1 - 14
  • [40] Efficient Transformer-Based Compressed Video Modeling via Informative Patch Selection
    Suzuki, Tomoyuki
    Aoki, Yoshimitsu
    SENSORS, 2023, 23 (01)