Sentiment Analysis of Text Reviews Using Lexicon-Enhanced Bert Embedding (LeBERT) Model with Convolutional Neural Network

被引:40
|
作者
Mutinda, James [1 ]
Mwangi, Waweru [2 ]
Okeyo, George [3 ]
机构
[1] Kenya Sch Govt Embu Campus, Dept Res & Consultancy, POB 402, Embu 60100, Kenya
[2] Jomo Kenyatta Univ Agr & Technol, Dept Comp, POB 62000, Nairobi 00200, Kenya
[3] Carnegie Mellon Univ Africa, Coll Engn, Kigali Innovat City, BP 6150, Bumbogo, Kigali, Rwanda
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 03期
关键词
natural language processing; word embeddings; BERT; sentiment analysis; convolutional neural network; sentiment lexicon;
D O I
10.3390/app13031445
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Sentiment analysis has become an important area of research in natural language processing. This technique has a wide range of applications, such as comprehending user preferences in ecommerce feedback portals, politics, and in governance. However, accurate sentiment analysis requires robust text representation techniques that can convert words into precise vectors that represent the input text. There are two categories of text representation techniques: lexicon-based techniques and machine learning-based techniques. From research, both techniques have limitations. For instance, pre-trained word embeddings, such as Word2Vec, Glove, and bidirectional encoder representations from transformers (BERT), generate vectors by considering word distances, similarities, and occurrences ignoring other aspects such as word sentiment orientation. Aiming at such limitations, this paper presents a sentiment classification model (named LeBERT) combining sentiment lexicon, N-grams, BERT, and CNN. In the model, sentiment lexicon, N-grams, and BERT are used to vectorize words selected from a section of the input text. CNN is used as the deep neural network classifier for feature mapping and giving the output sentiment class. The proposed model is evaluated on three public datasets, namely, Amazon products' reviews, Imbd movies' reviews, and Yelp restaurants' reviews datasets. Accuracy, precision, and F-measure are used as the model performance metrics. The experimental results indicate that the proposed LeBERT model outperforms the existing state-of-the-art models, with a F-measure score of 88.73% in binary sentiment classification.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Lexicon-Enhanced Attention Network Based on Text Representation for Sentiment Classification
    Li, Wenkuan
    Li, Dongyuan
    Yin, Hongxia
    Zhang, Lindong
    Zhu, Zhenfang
    Liu, Peiyu
    APPLIED SCIENCES-BASEL, 2019, 9 (18):
  • [2] A Lexicon-Enhanced Attention Network for Aspect-Level Sentiment Analysis
    Ren, Zhiying
    Zeng, Guangping
    Chen, Liu
    Zhang, Qingchuan
    Zhang, Chunguang
    Pan, Dingqi
    IEEE ACCESS, 2020, 8 (08): : 93464 - 93471
  • [3] Text Sentiment Analysis based on BERT and Convolutional Neural Networks
    Huang, P.
    Zhu, H. J.
    Zheng, L.
    Wang, Y.
    2021 5TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2021, 2021, : 1 - 7
  • [4] Sentiment Analysis on Customer Reviews using Convolutional Neural Network
    Tammina, Srikanth
    Annareddy, Sunil
    2020 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI - 2020), 2020, : 518 - 523
  • [5] Lexicon-Enhanced Multi-Task Convolutional Neural Network for Emotion Distribution Learning
    Dong, Yuchang
    Zeng, Xueqiang
    AXIOMS, 2022, 11 (04)
  • [6] A fuzzy convolutional neural network for text sentiment analysis
    Tuan-Linh Nguyen
    Kavuri, Swathi
    Lee, Minho
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (06) : 6025 - 6034
  • [7] Lexicon-enhanced sentiment analysis framework using rule-based classification scheme
    Asghar, Muhammad Zubair
    Khan, Aurangzeb
    Ahmad, Shakeel
    Qasim, Maria
    Khan, Imran Ali
    PLOS ONE, 2017, 12 (02):
  • [8] Text classification problems via BERT embedding method and graph convolutional neural network
    Loc Tran
    Lam Pham
    Tuan Tran
    An Mai
    2021 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC 2021), 2021, : 260 - 264
  • [9] Exploring multilingual reviews for aspect-based sentiment analysis using Lexicon and BERT
    Rana, Muhammad Rizwan Rashid
    Ali, Tariq
    Nawaz, Asif
    SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, 2024, 42 (05): : 1469 - 1479
  • [10] Short text sentiment analysis based on convolutional neural network
    Li, Weisen
    Li, Zhiqing
    Fang, Xupeng
    2018 14TH INTERNATIONAL CONFERENCE ON WIRELESS AND MOBILE COMPUTING, NETWORKING AND COMMUNICATIONS (WIMOB 2018), 2018, : 291 - 295