Dimensionality Reduction for Sentiment Classification using Machine Learning Classifiers

被引:0
|
作者
Islam, Mazharul [1 ]
Anjum, Aftab [1 ]
Ahsan, Tanveer [2 ]
Wang, Lin [1 ]
机构
[1] Univ Jinan, Shandong Prov Key Lab Network Based Intelligent C, Jinan 250022, Peoples R China
[2] Int Islamic Univ, Comp Sci & Engn, Kumira 4318, Chittagong, Bangladesh
基金
中国国家自然科学基金;
关键词
sentiment classification; dimensionality reduction; feature reduction; term presence count; term presence ratio;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis intends to identify the opinion either positive or negative given by clients or users from review documents. Sentiment analysis utilizing machine learning strategies faces the issue of high dimensionality of the feature vector. Consequently, a feature reduction strategy is required to dispose of the unessential and noisy elements from the feature vector. Feature reduction techniques selects the prominent features for reducing size of the feature set. The features which are nearly distributed presented by different class in the feature vector, make complexity for the classifier to draw a clear decision boundary. In this work, we proposed two different approaches (i.e., Term Presence Count (TPC) and Term Presence Ratio (TPR)) to remove those redundant features in positively and negatively tagged documents with nearly equal distribution. We applied four machine learning-based classification techniques including Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and Naive Bayes (NB) for sentiment classification using movie review dataset. Finally, the classifiers are evaluated in terms of accuracy, precision, recall, and Average F-measure. Experimental results manifest that the feature dimension reduced to approximately 83% by our proposed method while improving the classification performance.
引用
收藏
页码:3097 / 3103
页数:7
相关论文
共 50 条
  • [41] Sentiment Analysis for Arabic Reviews using Machine Learning Classification Algorithms
    Sayed, Awny A.
    Elgeldawi, Enas
    Zaki, Alaa M.
    Galal, Ahmed R.
    PROCEEDINGS OF 2020 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN COMMUNICATION AND COMPUTER ENGINEERING (ITCE), 2020, : 56 - 63
  • [42] NLP-based clinical text classification and sentiment analyses of complex medical transcripts using transformer model and machine learning classifiers
    Pratiyush Guleria
    Neural Computing and Applications, 2025, 37 (1) : 341 - 366
  • [43] Casting Online Votes: To Predict Offline Results Using Sentiment Analysis by machine learning Classifiers
    Juneja, Pragya
    Ojha, Uma
    2017 8TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2017,
  • [44] Dimensionality Reduction Applied to Spam Filtering using Bayesian Classifiers
    Almeida, Tiago A.
    Yamakami, Akebo
    REVISTA BRASILEIRA DE COMPUTACAO APLICADA, 2011, 3 (01): : 16 - 29
  • [45] Sentiment analysis of financial Twitter posts on Twitter with the machine learning classifiers
    Cam, Handan
    Cam, Alper Veli
    Demirel, Ugur
    Ahmed, Sana
    HELIYON, 2024, 10 (01)
  • [46] Twitter Sentiment Analysis with Different Feature Extractors and Dimensionality Reduction using Supervised Learning Algorithms
    Shyamasundar, L. B.
    Rani, Jhansi P.
    2016 IEEE ANNUAL INDIA CONFERENCE (INDICON), 2016,
  • [47] Classification of Testable and Valuable User Stories by using Supervised Machine Learning Classifiers
    Subedi, Ishan Mani
    Singh, Maninder
    Ramasamy, Vijayalakshmi
    Walia, Gursimran Singh
    2021 IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2021), 2021, : 409 - 414
  • [48] Depression Level Classification Using Machine Learning Classifiers Based on Actigraphy Data
    Choi, Jung-Gu
    Ko, Inhwan
    Han, Sanghoon
    IEEE ACCESS, 2021, 9 : 116622 - 116646
  • [49] Ensembles of classifiers based on dimensionality reduction
    Schclar, Alon
    Rokach, Lior
    Amit, Amir
    INTELLIGENT DATA ANALYSIS, 2017, 21 (03) : 467 - 489
  • [50] Discriminative Dimensionality Reduction for the Visualization of Classifiers
    Gisbrecht, Andrej
    Schulz, Alexander
    Hammer, Barbara
    PATTERN RECOGNITION APPLICATIONS AND METHODS, ICPRAM 2013, 2015, 318 : 39 - 56