Contextualized Embeddings based Transformer Encoder for Sentence Similarity Modeling in Answer Selection Task

被引：0

作者：

Laskar, Md Tahmid Rahman ^{[1
,3
]}

Huang, Jimmy ^{[2
,3
]}

Hoque, Enamul ^{[2
]}

机构：

[1] York Univ, Dept Elect Engn & Comp Sci, Toronto, ON, Canada

[2] York Univ, Sch Informat Technol, Toronto, ON, Canada

[3] York Univ, Informat Retrieval & Knowledge Management Res Lab, Toronto, ON, Canada

来源：

PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020) | 2020年

基金：

加拿大自然科学与工程研究理事会;

关键词：

Answer Selection; Transformer Encoder; Contextualized Embeddings; ELMo; BERT; RoBERTa; Deep Learning;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Word embeddings that consider context have attracted great attention for various natural language processing tasks in recent years. In this paper, we utilize contextualized word embeddings with the transformer encoder for sentence similarity modeling in the answer selection task. We present two different approaches (feature-based and fine-tuning-based) for answer selection. In the feature-based approach, we utilize two types of contextualized embeddings, namely the Embeddings from Language Models (ELMo) and the Bidirectional Encoder Representations from Transformers (BERT) and integrate each of them with the transformer encoder. We find that integrating these contextual embeddings with the transformer encoder is effective to improve the performance of sentence similarity modeling. In the second approach, we fine-tune two pre-trained transformer encoder models for the answer selection task. Based on our experiments on six datasets, we find that the fine-tuning approach outperforms the feature-based approach on all of them. Among our fine-tuning-based models, the Robustly Optimized BERT Pretraining Approach (RoBERTa) model results in new state-of-the-art performance across five datasets.

引用

页码：5505 / 5514

页数：10

共 50 条

[21] Attention-based encoder-decoder model for answer selection in question answering
Nie, Yuan-ping
Han, Yi
Huang, Jiu-ming
Jiao, Bo
Li, Ai-ping
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (04) : 535 - 544
[22] Transformer-Based Sequence Modeling Short Answer Assessment Framework
Sharmila, P.
Anbananthen, Kalaiarasi Sonai Muthu
Chelliah, Deisy
Parthasarathy, S.
Balasubramaniam, Baarathi
Lurudusamy, Saravanan Nathan
HighTech and Innovation Journal, 2024, 5 (03): : 627 - 639
[23] Reference-based Weak Supervision for Answer Sentence Selection using Web Data
Krishnamurthy, Vivek
Vu, Thuy
Moschitti, Alessandro
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4294 - 4299
[24] Empowering Short Answer Grading: Integrating Transformer-Based Embeddings and BI-LSTM Network
Gomaa, Wael H.
Nagib, Abdelrahman E.
Saeed, Mostafa M.
Algarni, Abdulmohsen
Nabil, Emad
BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (03)
[25] Modeling And Modification Of Converter Transformer Similarity Model Based On Finite Element And Similarity Theory
Wang, Hao
Zhang, Li
Sun, Youliang
Zhang, Zhuangzhuang
Wang, Dong
2022 4TH INTERNATIONAL CONFERENCE ON SMART POWER & INTERNET ENERGY SYSTEMS, SPIES, 2022, : 395 - 401
[26] Hierarchical Shared Encoder With Task-Specific Transformer Layer Selection for Emotion-Cause Pair Extraction
Su, Xinxin
Huang, Zhen
Su, Yixin
Trisedya, Bayu Distiawan
Dou, Yong
Zhao, Yunxiang
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (04) : 1934 - 1948
[27] Multi-modal Automatic Video Segmentation with Sentence Transformer Embeddings and KeyBERT-Based Subtopic Extraction
Vasuki, M.
Gangadharan, M. Arun
Daniel, Jibin Thomas
Sadashiv, Arjun
Venugopal, Vivek
Vekkot, Susmitha
2024 2ND WORLD CONFERENCE ON COMMUNICATION & COMPUTING, WCONF 2024, 2024,
[28] TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning
Wang, Kexin
Reimers, Nils
Gurevych, Iryna
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 671 - 688
[29] Fast reference frame selection based on content similarity for low complexity HEVC encoder
Pan, Zhaoqing
Jin, Peng
Lei, Jianjun
Zhang, Yun
Sun, Xingming
Kwong, Sam
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 40 : 516 - 524
[30] Semantic Aware Answer Sentence Selection using Self-Learning based Domain Adaptation
Sarkar, Rajdeep
Dutta, Sourav
Assem, Haytham
Arcan, Mihael
McCrae, John
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3849 - 3857

← 1 2 3 4 5 →