Dimensionality Reduction for Sentiment Classification using Machine Learning Classifiers

被引:0
|
作者
Islam, Mazharul [1 ]
Anjum, Aftab [1 ]
Ahsan, Tanveer [2 ]
Wang, Lin [1 ]
机构
[1] Univ Jinan, Shandong Prov Key Lab Network Based Intelligent C, Jinan 250022, Peoples R China
[2] Int Islamic Univ, Comp Sci & Engn, Kumira 4318, Chittagong, Bangladesh
基金
中国国家自然科学基金;
关键词
sentiment classification; dimensionality reduction; feature reduction; term presence count; term presence ratio;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis intends to identify the opinion either positive or negative given by clients or users from review documents. Sentiment analysis utilizing machine learning strategies faces the issue of high dimensionality of the feature vector. Consequently, a feature reduction strategy is required to dispose of the unessential and noisy elements from the feature vector. Feature reduction techniques selects the prominent features for reducing size of the feature set. The features which are nearly distributed presented by different class in the feature vector, make complexity for the classifier to draw a clear decision boundary. In this work, we proposed two different approaches (i.e., Term Presence Count (TPC) and Term Presence Ratio (TPR)) to remove those redundant features in positively and negatively tagged documents with nearly equal distribution. We applied four machine learning-based classification techniques including Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and Naive Bayes (NB) for sentiment classification using movie review dataset. Finally, the classifiers are evaluated in terms of accuracy, precision, recall, and Average F-measure. Experimental results manifest that the feature dimension reduced to approximately 83% by our proposed method while improving the classification performance.
引用
收藏
页码:3097 / 3103
页数:7
相关论文
共 50 条
  • [21] Sentiment Analysis and Classification of Restaurant Reviews using Machine Learning
    Zahoor, Kanwal
    Bawany, Narmeen Zakaria
    Hamid, Soomaiya
    2020 21ST INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2020,
  • [22] Coupling NCA Dimensionality Reduction with Machine Learning in Multispectral Rock Classification Problems
    Sinaice, Brian Bino
    Owada, Narihiro
    Saadat, Mahdi
    Toriya, Hisatoshi
    Inagaki, Fumiaki
    Bagai, Zibisani
    Kawamura, Youhei
    MINERALS, 2021, 11 (08)
  • [23] Machine Learning Classification Techniques to Predict Directional Change of Energy Prices Using High Dimensionality Reduction
    Moni, Vidya
    Mattipalli, Maheshwari
    Badar, Altaf Q. H.
    PROCEEDING OF THE 2ND 2022 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (CSASE 2022), 2022, : 247 - 252
  • [24] Ship Engine Model Selection by Applying Machine Learning Classification Techniques Using Imputation and Dimensionality Reduction
    Skarlatos, Kyriakos
    Papageorgiou, Grigorios
    Biris, Panagiotis
    Skamnia, Ekaterini
    Economou, Polychronis
    Bersimis, Sotirios
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (01)
  • [25] POST-PROCESSING AND DIMENSIONALITY REDUCTION FOR EXTREME LEARNING MACHINE IN TEXT CLASSIFICATION
    Trusca, Maria Mihaela
    Aldea, Anamaria
    Gradinaru, Simona Elena
    Albu, Crisan
    ECONOMIC COMPUTATION AND ECONOMIC CYBERNETICS STUDIES AND RESEARCH, 2021, 55 (04): : 37 - 50
  • [26] Sentiment Classification for Film Reviews in Gujarati Text Using Machine Learning and Sentiment Lexicons
    Shah, Parita
    Swaminarayan, Priya
    Patel, Maitri
    JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2022, 17 (01) : 1 - 16
  • [27] Interrogating machine learning classifiers and dimensionality reduction techniques for radiomic prediction of glioma tumor grade.
    Wahid, Kareem
    Kotrotsou, Aikaterini
    Abrol, Srishti
    Hassan, Ahmed
    Elshafeey, Nabil
    Colen, Rivka R.
    JOURNAL OF CLINICAL ONCOLOGY, 2018, 36 (15)
  • [28] Dimensionality Reduction Strategies for the Design of Human Machine Interface Signal Classifiers
    Gupta, Lalit
    Kota, Srinivas
    Murali, Swetha
    Molfese, Dennis
    Vaidyanathan, Ravi
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 2431 - +
  • [29] Sentiment classification on product reviews using machine learning and deep learning techniques
    Singh, Neha
    Jaiswal, Umesh Chandra
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (12) : 5726 - 5741
  • [30] Classification of Neurodegenerative Disease Stages using Ensemble Machine Learning Classifiers
    Rohini, M.
    Surendran, D.
    2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 : 66 - 73