An ensemble model for classifying idioms and literal texts using BERT and RoBERTa

被引:68
|
作者
Briskilal, J. [1 ]
Subalalitha, C. N. [1 ]
机构
[1] SRM Inst Sci & Technol, Chengalpattu, Tamil Nadu, India
关键词
BERT; RoBERTa; Ensemble model; Idiom; Literal classification;
D O I
10.1016/j.ipm.2021.102756
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An idiom is a common phrase that means something other than its literal meaning. Detecting idioms automatically is a serious challenge in natural language processing (NLP) domain appli-cations like information retrieval (IR), machine translation and chatbot. Automatic detection of Idioms plays an important role in all these applications. A fundamental NLP task is text classi-fication, which categorizes text into structured categories known as text labeling or categoriza-tion. This paper deals with idiom identification as a text classification task. Pre-trained deep learning models have been used for several text classification tasks; though models like BERT and RoBERTa have not been exclusively used for idiom and literal classification. We propose a pre-dictive ensemble model to classify idioms and literals using BERT and RoBERTa, fine-tuned with the TroFi dataset. The model is tested with a newly created in house dataset of idioms and literal expressions, numbering 1470 in all, and annotated by domain experts. Our model outperforms the baseline models in terms of the metrics considered, such as F-score and accuracy, with a 2% improvement in accuracy.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
    Muneer, Amgad
    Alwadain, Ayed
    Ragab, Mohammed Gamal
    Alqushaibi, Alawi
    INFORMATION, 2023, 14 (08)
  • [42] Heterogeneous Student Knowledge Distillation From BERT Using a Lightweight Ensemble Framework
    Lin, Ching-Sheng
    Tsai, Chung-Nan
    Jwo, Jung-Sing
    Lee, Cheng-Hsiung
    Wang, Xin
    IEEE ACCESS, 2024, 12 : 33079 - 33088
  • [43] Teaching Metaphors and Idioms Using Conceptual Key Model
    Caliskan, Nihal
    BILIG, 2013, (64) : 95 - 122
  • [44] Sentiment analysis of imbalanced datasets using BERT and ensemble stacking for deep learning
    Habbat, Nassera
    Nouri, Hicham
    Anoun, Houda
    Hassouni, Larbi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [45] Classifying ductal trees using geometrical features and ensemble learning techniques
    Skoura, A. (skoura@ceid.upatras.gr), 1600, Springer Verlag (384):
  • [46] Detection of Web Attacks Using the BERT Model
    Seyyar, Yunus Emre
    Yavuz, Ali Gokhan
    Unver, Halil Murat
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [47] Arabic Sentiment Analysis Using BERT Model
    Chouikhi, Hasna
    Chniter, Hamza
    Jarray, Fethi
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2021), 2021, 1463 : 621 - 632
  • [48] Classifying Ductal Trees Using Geometrical Features and Ensemble Learning Techniques
    Skoura, Angeliki
    Nuzhnaya, Tatyana
    Bakic, Predrag R.
    Megalooikonomou, Vasilis
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, PT II, 2013, 384 : 146 - 155
  • [49] BERT-Based Logits Ensemble Model for Gender Bias and Hate Speech Detection
    Yun, Sanggeon
    Kang, Seungshik
    Kim, Hyeokman
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2023, 19 (05): : 641 - 651
  • [50] BERT-Based Ensemble Model for Statute Law Retrieval and Legal Information Entailment
    Shao, Hsuan-Lei
    Chen, Yi-Chia
    Huang, Sieh-Chuen
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, JSAI-ISAI 2020, 2021, 12758 : 226 - 239