Dimensionality Reduction for Sentiment Classification using Machine Learning Classifiers

被引:0
|
作者
Islam, Mazharul [1 ]
Anjum, Aftab [1 ]
Ahsan, Tanveer [2 ]
Wang, Lin [1 ]
机构
[1] Univ Jinan, Shandong Prov Key Lab Network Based Intelligent C, Jinan 250022, Peoples R China
[2] Int Islamic Univ, Comp Sci & Engn, Kumira 4318, Chittagong, Bangladesh
基金
中国国家自然科学基金;
关键词
sentiment classification; dimensionality reduction; feature reduction; term presence count; term presence ratio;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis intends to identify the opinion either positive or negative given by clients or users from review documents. Sentiment analysis utilizing machine learning strategies faces the issue of high dimensionality of the feature vector. Consequently, a feature reduction strategy is required to dispose of the unessential and noisy elements from the feature vector. Feature reduction techniques selects the prominent features for reducing size of the feature set. The features which are nearly distributed presented by different class in the feature vector, make complexity for the classifier to draw a clear decision boundary. In this work, we proposed two different approaches (i.e., Term Presence Count (TPC) and Term Presence Ratio (TPR)) to remove those redundant features in positively and negatively tagged documents with nearly equal distribution. We applied four machine learning-based classification techniques including Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and Naive Bayes (NB) for sentiment classification using movie review dataset. Finally, the classifiers are evaluated in terms of accuracy, precision, recall, and Average F-measure. Experimental results manifest that the feature dimension reduced to approximately 83% by our proposed method while improving the classification performance.
引用
收藏
页码:3097 / 3103
页数:7
相关论文
共 50 条
  • [31] Email Spam Classification and Detection using Various Machine Learning Classifiers
    Saraswathi, N.
    Pradeep, S.
    Sathiyavathi, V.
    Sabitha, K.
    Kambattan, K. Rajesh
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [32] NETWORK INTRUSION DETECTION SYSTEMS USING SUPERVISED MACHINE LEARNING CLASSIFICATION AND DIMENSIONALITY REDUCTION TECHNIQUES: A SYSTEMATIC REVIEW
    Ashi, Zein
    Aburashed, Laila
    Al-Qudah, Mahmoud
    Qusef, Abdallah
    JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2021, 7 (04): : 373 - 390
  • [33] Classification of As, Pb and Cd Heavy Metal Ions Using Square Wave Voltammetry, Dimensionality Reduction and Machine Learning
    Leon-Medina, Jersson X.
    Tibaduiza, Diego A.
    Burgos, Juan C.
    Cuenca, Martha
    Vasquez, Dreidy
    IEEE ACCESS, 2022, 10 : 7684 - 7694
  • [34] Spatial Correlation Preserving EEG Dimensionality Reduction Using Machine Learning
    Gebre-Amlak, Haymanot
    Nguyen, Hoang
    Lowe, Jesse
    Nabulsi, Ala-Addin
    Chu, Narisa Nan
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2583 - 2589
  • [35] Fast and Reliable DDoS Detection using Dimensionality Reduction and Machine Learning
    Ashi, Zein
    Aburashed, Laila
    Al-Fawa'reh, Mohammad
    Qasaimeh, Malek
    INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST-2020), 2020, : 13 - 22
  • [36] Onto-based sentiment classification using Machine Learning Techniques
    Saranya, K.
    Jayanthy, S.
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [37] ChatGPT Tweets Sentiment Analysis Using Machine Learning and Data Classification
    Sabir A.
    Ali H.A.
    Aljabery M.A.
    Informatica (Slovenia), 2024, 48 (07): : 103 - 112
  • [38] Classification of Sentiment Reviews for Indian Railways Using Machine Learning Methods
    Bagga, Manju
    Aggarwa, Ritu
    Arora, Nitika
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 171 - 177
  • [39] Twitter Sentiment Classification Using Machine Learning Techniques for Stock Markets
    Qasem, Mohammed
    Thulasiram, Ruppa
    Thulasiram, Parimala
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 834 - 840
  • [40] Dictionary learning based dimensionality reduction for classification
    Schnass, Karin
    Vandergheynst, Pierre
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 780 - +