Dimensionality Reduction for Sentiment Classification using Machine Learning Classifiers

被引:0
|
作者
Islam, Mazharul [1 ]
Anjum, Aftab [1 ]
Ahsan, Tanveer [2 ]
Wang, Lin [1 ]
机构
[1] Univ Jinan, Shandong Prov Key Lab Network Based Intelligent C, Jinan 250022, Peoples R China
[2] Int Islamic Univ, Comp Sci & Engn, Kumira 4318, Chittagong, Bangladesh
基金
中国国家自然科学基金;
关键词
sentiment classification; dimensionality reduction; feature reduction; term presence count; term presence ratio;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis intends to identify the opinion either positive or negative given by clients or users from review documents. Sentiment analysis utilizing machine learning strategies faces the issue of high dimensionality of the feature vector. Consequently, a feature reduction strategy is required to dispose of the unessential and noisy elements from the feature vector. Feature reduction techniques selects the prominent features for reducing size of the feature set. The features which are nearly distributed presented by different class in the feature vector, make complexity for the classifier to draw a clear decision boundary. In this work, we proposed two different approaches (i.e., Term Presence Count (TPC) and Term Presence Ratio (TPR)) to remove those redundant features in positively and negatively tagged documents with nearly equal distribution. We applied four machine learning-based classification techniques including Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and Naive Bayes (NB) for sentiment classification using movie review dataset. Finally, the classifiers are evaluated in terms of accuracy, precision, recall, and Average F-measure. Experimental results manifest that the feature dimension reduced to approximately 83% by our proposed method while improving the classification performance.
引用
收藏
页码:3097 / 3103
页数:7
相关论文
共 50 条
  • [1] Sentiment classification for insider threat identification using metaheuristic optimized machine learning classifiers
    Mladenovic, Djordje
    Antonijevic, Milos
    Jovanovic, Luka
    Simic, Vladimir
    Zivkovic, Miodrag
    Bacanin, Nebojsa
    Zivkovic, Tamara
    Perisic, Jasmina
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [2] Drug Sentiment Analysis using Machine Learning Classifiers
    Uddin, Mohammed Nazim
    Bin Hafiz, Md Ferdous
    Hossain, Sohrab
    Islam, Shah Mohammad Mominul
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (01) : 92 - 100
  • [3] Optimization of sentiment analysis using machine learning classifiers
    Singh, Jaspreet
    Singh, Gurvinder
    Singh, Rajinder
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2017, 7
  • [4] An approach to text classification using dimensionality reduction and combination of classifiers
    Jain, G
    Ginwala, A
    Aslandogan, YA
    PROCEEDINGS OF THE 2004 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI-2004), 2004, : 564 - 569
  • [5] Leveraging machine learning and dimensionality reduction for sports and exercise sentiment analysis
    Srivastava, Shobhit
    Chakraborty, Chinmay
    Sarkar, Mrinal Kanti
    Measurement: Sensors, 2024, 33
  • [6] A Novel Framework for Sentiment Analysis: Dimensionality Reduction for Machine Learning (DRML)
    Dhamayanthi, N.
    Lavanya, B.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (06) : 777 - 794
  • [7] Sentiment Analysis Using Machine Learning Classifiers: Evaluation of Performance
    Rai, Shamantha B.
    Shetty, Sweekriti M.
    Rai, Prakhyath
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 21 - 25
  • [8] Classification of Sentiment Analysis Using Machine Learning
    Parikh, Satyen M.
    Shah, Mitali K.
    INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, 2020, 46 : 76 - 86
  • [9] Classification of primary biliary cirrhosis using hybridization of dimensionality reduction and machine learning methods
    Singh, Aman
    Pandey, Babita
    2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 1, 2016, : 199 - 203
  • [10] Robust and Efficient Classification for Underground Metal Target Using Dimensionality Reduction and Machine Learning
    Wan, Yadong
    Li, Tong
    Wang, Peng
    Duan, Shihong
    Zhang, Chao
    Li, Na
    IEEE ACCESS, 2021, 9 : 7384 - 7401