Comprehension of polarity of articles by citation sentiment analysis using TF-IDF and ML classifiers

被引:0
|
作者
Karim M. [1 ]
Missen M.M.S. [1 ]
Umer M. [1 ]
Fida A. [1 ]
Eshmawi A.A. [2 ]
Mohamed A. [3 ]
Ashraf I. [4 ]
机构
[1] Department of Computer Science & Information Technology, Islamia University, Bahawalpur
[2] University of Jeddah, Department of Cybersecurity, College of Computer Science and Engineering, Jeddah
[3] University Research Centre, Future University, Cairo
[4] Information and Communication Engineering, Yeungnam University, Gyeongsan
关键词
Citation sentiment analysis; Dataset balancing; Machine learning; SMOTE; Term frequency-inverse document frequency;
D O I
10.7717/PEERJ-CS.1107
中图分类号
学科分类号
摘要
Sentiment analysis has been researched extensively during the last few years, however, the sentiment analysis of citations in a research article is an unexplored research area. Sentiment analysis of citations can provide new applications in bibliometrics and provide insights for a better understanding of scientific knowledge. Citation count, as it is used today to measure the quality of a paper, does not portray the quality of a scientific article, as the article may be cited to indicate its weakness. So determining the polarity of a citation is an important task to quantify the quality of the cited article and ascertain its impact and ranking. This article presents an approach to determine the polarity of the cited article using term frequency-inverse document frequency and machine learning classifiers. To analyze the influence of an imbalanced dataset, several experiments are performed with and without the synthetic minority oversampling technique (SMOTE) and uni-gram and bi-gram term frequency-inverse document frequency (TF-IDF). Results indicate that the proposed methodology achieves high accuracy of 99.0% with the extra tree classifier when trained on SMOTE oversampled dataset and bi-gram features. © Karim 2022 et al.
引用
收藏
相关论文
共 50 条
  • [1] Comprehension of polarity of articles by citation sentiment analysis using TF-IDF and ML classifiers
    Karim, Musarat
    Missen, Malik Muhammad Saad
    Umer, Muhammad
    Fida, Alisha
    Eshmawi, Ala' Abdulmajid
    Mohamed, Abdullah
    Ashraf, Imran
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [2] Supervised classifiers with TF-IDF features for sentiment analysis of Marathi tweets
    Patil, Rupali S.
    Kolhe, Satish R.
    SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
  • [3] Supervised classifiers with TF-IDF features for sentiment analysis of Marathi tweets
    Rupali S. Patil
    Satish R. Kolhe
    Social Network Analysis and Mining, 2022, 12
  • [4] Evaluation of the Delta TF-IDF Features for Sentiment Analysis
    Samoylov, Andrew B.
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, 2014, 436 : 207 - 212
  • [5] Sentiment analysis using TF-IDF weighting of UK MPs' tweets on Brexit
    Mee, Alexander
    Homapour, Elmina
    Chiclana, Francisco
    Engel, Ofer
    KNOWLEDGE-BASED SYSTEMS, 2021, 228
  • [6] Sentiment Enhanced Hybrid TF-IDF for Microblogs
    Simsek, Atakan
    Karagoz, Pinar
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 311 - 317
  • [7] Research on Sentiment Analysis of Microblogging Based on LSA and TF-IDF
    Li, Yingying
    Shen, Bo
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 2584 - 2588
  • [8] A Sentiment analysis-based hotel recommendation using TF-IDF Approach
    Mishra, Ram Krishn
    Urolagin, Siddhaling
    Jothi, Angel Arul J.
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, : 811 - 815
  • [9] Emotion Analysis in Text using TF-IDF
    Sundaram, Varun
    Ahmed, Saad
    Muqtadeer, Shaik Abdul
    Reddy, R. Ravinder
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 292 - 297
  • [10] Hybrid classifier for sentiment analysis in Malayalam with modified TF-IDF features
    Pramitha, P. Ambily
    Abraham, John T.
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2023, 14 (05)