Effect of Various Factors in Context of Feature Selection on Opinion Spam Detection

被引:2
|
作者
Rastogi, Ajay [1 ]
Mehrotra, Monica [1 ]
Ali, Syed Shafat [1 ]
机构
[1] Jamie Millia Islamia, Dept Comp Sci, New Delhi, India
关键词
feature selection; opinion spun; online reviews; classification; filter-based; model-based;
D O I
10.1109/Confluence51648.2021.9377056
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the growing popularity of online reviews, spammers often target specific products or services with the aim to mislead consumers in their purchase decisions. This has opened doors for researchers to study the problem of opinion spam detection. Till date, many effective and efficient solutions have been proposed in this regard using various types of features. However, most of the feature engineering tasks extract thousands of features, which may lead to degrade the performance and increase computation cost involved in many machine learning algorithms. Feature selection methods can greatly improve classification performance along with the reduction in computation cost of model training. In this paper, we investigate the effect of different feature selection techniques on opinion spam detection. For the same, various feature selection methods (filter-based and model-based) with varying number of features have been employed to train four different classification models. In addition, three well-known review datasets from different domains (hotel, doctor and restaurant) and four different types of features, viz., unigram, bigram, part-of-speech frequency count and word embedding, have been used to examine the impact of different factors responsible to improve the performance in opinion spam domain. Our experimental results demonstrate how different factors affect classification performance and cost, which is statistically validated by using Analysis of Variance test.
引用
收藏
页码:778 / 783
页数:6
相关论文
共 50 条
  • [1] Opinion Spam Detection Using Feature Selection
    Patel, Rinki
    Thakkar, Priyank
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 560 - 564
  • [2] A comprehensive survey of various methods in opinion spam detection
    Arvind Mewada
    Rupesh Kumar Dewang
    Multimedia Tools and Applications, 2023, 82 : 13199 - 13239
  • [3] A comprehensive survey of various methods in opinion spam detection
    Mewada, Arvind
    Dewang, Rupesh Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 13199 - 13239
  • [4] Dynamic Feature Selection for Spam Detection in Twitter
    Karakasli, M. Salih
    Aydin, Muhammed Ali
    Yarkan, Serhan
    Boyaci, Ali
    INTERNATIONAL TELECOMMUNICATIONS CONFERENCE, ITELCON 2017, 2019, 504 : 239 - 250
  • [5] Deceptive opinion spam detection using feature reduction techniques
    Maurya, Sushil Kumar
    Singh, Dinesh
    Maurya, Ashish Kumar
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (03) : 1210 - 1230
  • [6] Deceptive opinion spam detection using feature reduction techniques
    Sushil Kumar Maurya
    Dinesh Singh
    Ashish Kumar Maurya
    International Journal of System Assurance Engineering and Management, 2024, 15 : 1210 - 1230
  • [7] Genetic-based Feature Selection for Spam Detection
    Arani, Seyyed Hossein Seyyedi
    Mozaffari, Saeed
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [8] Spam Detection Using Feature Selection and Parameters Optimization
    Lee, Sang Min
    Kim, Dong Seong
    Kim, Ji Ho
    Park, Jong Sou
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS (CISIS 2010), 2010, : 883 - 888
  • [9] An effective feature selection method for web spam detection
    Asdaghi, Faeze
    Soleimani, Ali
    KNOWLEDGE-BASED SYSTEMS, 2019, 166 : 198 - 206
  • [10] Detection of Spam Using Particle Swarm Optimisation in Feature Selection
    Singh, Surender
    Singh, Ashutosh Kumar
    PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2018, 26 (03): : 1355 - 1371