A tweet sentiment classification approach using an ensemble classifier

被引:4
|
作者
KP V. [1 ]
AB R. [1 ]
HL G. [2 ]
Ravi V. [3 ]
Krichen M. [4 ,5 ]
机构
[1] Department of Information Science and Engineering, Vidyavardhaka College of Engineering, Mysuru
[2] Department of Information Technology, Manipal Institute of Technology Bengaluru, Manipal Academy of Higher Education, Manipal
[3] Center for Artificial Intelligence, Prince Mohammad Bin Fahd University, Khobar
[4] Department of Information Technology, Faculty of Computer Science and Information Technology (FCSIT), Al-Baha University, Alaqiq
[5] ReDCAD Laboratory, University of Sfax, Sfax
关键词
Adaptive boosting; Ensemble classifier; Sentiment analysis; Tweets; Twitter API;
D O I
10.1016/j.ijcce.2024.04.001
中图分类号
学科分类号
摘要
Social media users are more receptive to products or events and share their thoughts through raw textual data, which is classified as semi-structured data. This data, which is presented using a variety of terminologies, is noisy by nature but yet contains important information and superfluous details, giving analysts a way to identify patterns and knowledge. This hidden information must be extracted from language data in order to make informed decisions and create strategic plans for entering new markets. Among the most prominent fields of study are natural language processing (NLP) and data mining techniques, especially when it comes to sentiment analysis—the process of identifying the feelings and insights concealed in the data. Twitter is one of the significant microblogging platform with millions of users. These users use Twitter to share sentiments using hash tags on different topics and to make status updates known as tweets. Twitter is therefore regarded as a significant real-time source and as one of the most active opinion indicators. The volume of information is produced by Twitter is enormous and manually scanning the entire data set is difficult process. The paper proposed an ensemble classifier to categorize emotion of the tweets on the basis of polarities such as positive and negative. In our study, we ensemble classifiers which is a combination of Random Forest (RF), Support Vector Machine (SVM) and Decision Tree (DT). The data is collected from Twitter API and the Twitter data is analysed autonomously to define public view on particular topic. The features obtained after the process of dimensionality reduction using LDA undergoes the stage of feature selection using Wrapper based technique. The iterative Wrapper based technique predict score for the features, the features with low score are ignored and high score is proceeded for classification. The ensemble classifier used Adaptive Boosting (AdaBoost) technique where the output from the Machine Learning (ML) classifiers are combined to produce a single output. Adaboost combines the poor classifiers and extracts the prediction value to make a better classifier. The experimental results show that the proposed ensemble classifier provides better accuracy of 93.42 % that is comparatively better than existing Convolutional Bidirectional - Long Short-Term Memory (ConvBiLSTM) classifier and Hybrid Lexicon- Naïve Bayes Classifier (HL-NBC) which produce classification accuracy of 91.53 % and 89.61 % respectively. © 2024 The Authors
引用
收藏
页码:170 / 177
页数:7
相关论文
共 50 条
  • [31] Using unsupervised information to improve semi-supervised tweet sentiment classification
    Felipe da Silva, Nadia Felix
    Coletta, Luiz F. S.
    Hruschka, Eduardo R.
    Hruschka, Estevam R., Jr.
    INFORMATION SCIENCES, 2016, 355 : 348 - 365
  • [32] An Ensemble Based Approach for Sentiment Classification in Asian Regional Language
    Shelke, Mahesh B.
    Lee, Jeong Gon
    Samanta, Sovan
    Deshmukh, Sachin N.
    Daulappa, G. Bhalke
    Mannade, Rahul B.
    Sivaraman, Arun Kumar
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (03): : 2457 - 2468
  • [33] Classifier Ensemble Design for Imbalanced Data Classification: A Hybrid Approach
    Salunkhe, Uma R.
    Mali, Suresh N.
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELLING AND SECURITY (CMS 2016), 2016, 85 : 725 - 732
  • [34] BUILDING AN ENSEMBLE CLASSIFIER USING ENSEMBLE MARGIN. APPLICATION TO IMAGE CLASSIFICATION
    Guo, Li
    Boukir, Samia
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 4492 - 4496
  • [35] A multiobjective weighted voting ensemble classifier based on differential evolution algorithm for text sentiment classification
    Onan, Aytug
    Korukoglu, Serdar
    Bulut, Hasan
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 62 : 1 - 16
  • [36] Ensemble feature analysis classifier for sentiment analysis using convolutional neural networks
    Arunasafali, M.
    Suneetha, Chittineni
    INTERNATIONAL CONFERENCE ON COMPUTER VISION AND MACHINE LEARNING, 2019, 1228
  • [37] Corpus Creation in Telugu: Sentiment Classification Using Ensemble Approaches
    Chattu K.
    Sumathi D.
    SN Computer Science, 4 (6)
  • [38] Ensemble of Heterogeneous Classifiers for Improving Automated Tweet Classification
    Cui, Renhao
    Agrawal, Gagan
    Ramnath, Rajiv
    Khuc, Vinh
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 1045 - 1052
  • [39] Text Mining of Tweet for Sentiment Classification and Association with Stock Prices
    Urolagin, Siddhaling
    2017 INTERNATIONAL CONFERENCE ON COMPUTER AND APPLICATIONS (ICCA), 2017, : 384 - 388
  • [40] Classification of skin disease using ensemble-based classifier
    Thenmozhi, K.
    Babu, M. Rajesh
    INTERNATIONAL JOURNAL OF BIOMEDICAL ENGINEERING AND TECHNOLOGY, 2018, 28 (04) : 377 - 394