A tweet sentiment classification approach using an ensemble classifier

被引:4
|
作者
KP V. [1 ]
AB R. [1 ]
HL G. [2 ]
Ravi V. [3 ]
Krichen M. [4 ,5 ]
机构
[1] Department of Information Science and Engineering, Vidyavardhaka College of Engineering, Mysuru
[2] Department of Information Technology, Manipal Institute of Technology Bengaluru, Manipal Academy of Higher Education, Manipal
[3] Center for Artificial Intelligence, Prince Mohammad Bin Fahd University, Khobar
[4] Department of Information Technology, Faculty of Computer Science and Information Technology (FCSIT), Al-Baha University, Alaqiq
[5] ReDCAD Laboratory, University of Sfax, Sfax
关键词
Adaptive boosting; Ensemble classifier; Sentiment analysis; Tweets; Twitter API;
D O I
10.1016/j.ijcce.2024.04.001
中图分类号
学科分类号
摘要
Social media users are more receptive to products or events and share their thoughts through raw textual data, which is classified as semi-structured data. This data, which is presented using a variety of terminologies, is noisy by nature but yet contains important information and superfluous details, giving analysts a way to identify patterns and knowledge. This hidden information must be extracted from language data in order to make informed decisions and create strategic plans for entering new markets. Among the most prominent fields of study are natural language processing (NLP) and data mining techniques, especially when it comes to sentiment analysis—the process of identifying the feelings and insights concealed in the data. Twitter is one of the significant microblogging platform with millions of users. These users use Twitter to share sentiments using hash tags on different topics and to make status updates known as tweets. Twitter is therefore regarded as a significant real-time source and as one of the most active opinion indicators. The volume of information is produced by Twitter is enormous and manually scanning the entire data set is difficult process. The paper proposed an ensemble classifier to categorize emotion of the tweets on the basis of polarities such as positive and negative. In our study, we ensemble classifiers which is a combination of Random Forest (RF), Support Vector Machine (SVM) and Decision Tree (DT). The data is collected from Twitter API and the Twitter data is analysed autonomously to define public view on particular topic. The features obtained after the process of dimensionality reduction using LDA undergoes the stage of feature selection using Wrapper based technique. The iterative Wrapper based technique predict score for the features, the features with low score are ignored and high score is proceeded for classification. The ensemble classifier used Adaptive Boosting (AdaBoost) technique where the output from the Machine Learning (ML) classifiers are combined to produce a single output. Adaboost combines the poor classifiers and extracts the prediction value to make a better classifier. The experimental results show that the proposed ensemble classifier provides better accuracy of 93.42 % that is comparatively better than existing Convolutional Bidirectional - Long Short-Term Memory (ConvBiLSTM) classifier and Hybrid Lexicon- Naïve Bayes Classifier (HL-NBC) which produce classification accuracy of 91.53 % and 89.61 % respectively. © 2024 The Authors
引用
收藏
页码:170 / 177
页数:7
相关论文
共 50 条
  • [1] A Tweet Sentiment Classification Approach Using a Hybrid Stacked Ensemble Technique
    Gaye, Babacar
    Zhang, Dezheng
    Wulamu, Aziguli
    INFORMATION, 2021, 12 (09)
  • [2] Using Ensemble Learners to Improve Classifier Performance on Tweet Sentiment Data
    Prusa, Joseph
    Khoshgoftaar, Taghi M.
    Dittman, Daivd J.
    2015 IEEE 16TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2015, : 252 - 257
  • [3] Tweet Sentiment Classification by Semantic and Frequency Base Features Using Hybrid Classifier
    Menaria, Hemant Kumar
    Nagar, Pritesh
    Patel, Mayank
    FIRST INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR COMPUTATIONAL INTELLIGENCE, 2020, 1045 : 107 - 123
  • [4] Sentiment classification using hybrid feature selection and ensemble classifier
    Jain, Achin
    Jain, Vanita
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (02) : 659 - 668
  • [5] GBSVM: Sentiment Classification from Unstructured Reviews Using Ensemble Classifier
    Khalid, Madiha
    Ashraf, Imran
    Mehmood, Arif
    Ullah, Saleem
    Ahmad, Maqsood
    Choi, Gyu Sang
    APPLIED SCIENCES-BASEL, 2020, 10 (08):
  • [6] Tweet sentiment analysis with classifier ensembles
    da Silva, Nadia F. F.
    Hruschka, Eduardo R.
    Hruschka, Estevam R., Jr.
    DECISION SUPPORT SYSTEMS, 2014, 66 : 170 - 179
  • [7] Using Feature Selection in Combination with Ensemble Learning Techniques to Improve Tweet Sentiment Classification Performance
    Prusa, Joseph D.
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 186 - 193
  • [8] Tweet Sentiment Classification Using an Ensemble of Machine Learning Supervised Classifiers Employing Statistical Feature Selection Methods
    Devi, K. Lakshmi
    Subathra, P.
    Kumar, P. N.
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON FUZZY AND NEURO COMPUTING (FANCCO - 2015), 2015, 415 : 1 - 13
  • [9] Tweet Sentiment: From Classification to Quantification
    Gao, Wei
    Sebastiani, Fabrizio
    PROCEEDINGS OF THE 2015 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2015), 2015, : 97 - 104
  • [10] Precise Tweet Classification and Sentiment Analysis
    Batool, Rabia
    Khattak, Asad Masood
    Maqbool, Jahanzeb
    Lee, Sungyoung
    2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 461 - 466