OLAWSDS: An Online Arabic Web Spam Detection System

被引:0
|
作者
Al-Kabi, Mohammed N. [1 ]
Wahsheh, Heider A. [2 ]
Alsmadi, Izzat M. [3 ]
机构
[1] Zarqa Univ, Fac Sci & IT, Zarqa, Jordan
[2] King Khalid Univ, Coll Comp Sci, Dept Comp Sci, Abha, Saudi Arabia
[3] Prince Sultan Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11586, Saudi Arabia
关键词
Arabic Web spam; content-based; link-based; Information Retrieval;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
For marketing purposes, Some Websites designers and administrators use illegal Search Engine Optimization (SEO) techniques to optimize the ranking of their Web pages and mislead the search engines. Some Arabic Web pages use both content and link features, to increase artificially the rank of their Web pages in the Search Engine Results Pages (SERPs). This study represents an enhancement to previous work in this field. It includes the design and implementation of an online Arabic Web spam detection system, based on algorithms and mathematical foundations, which can detect the Arabic content and link web spam depending on the tree of the spam detection conditions, beside depending on the user's feedback through a custom Web browser. The users can participate in making the decision about any Web page, through their feedbacks, so they judge if the Arabic Web pages in the browser are relevant for their particular queries or not. The proposed system uses the extracted content and link features from Arabic Web pages to determine whether to label each Web page as a spam or as a nonspam. This system also attempts to learn from the user's feedback to enhance automatically its performance. Statistical analysis is adopted in this study to evaluate the proposed system. Statistical Package for the Social Sciences (SPSS) software is used to evaluate this new system which considers the users feedbacks as dependent variables, while Arabic content and links features on the other hand are considered independent variables. The statistical analysis with the SPSS is used to apply a variety of tests, such as the test of the analysis of variance (ANOVA). ANOVA is used to show the relationships between the dependent and independent variables in the dataset, which leads to solving problems and building intelligent decisions and results.
引用
收藏
页码:105 / 110
页数:6
相关论文
共 50 条
  • [1] Oases: An Online Scalable Spam Detection System for Social Networks
    Xu, Hailu
    Hu, Liting
    Liu, Pinchao
    Xiao, Yao
    Wang, Wentao
    Dayal, Jai
    Wang, Qingyang
    Tang, Yuzhe
    PROCEEDINGS 2018 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2018, : 98 - 105
  • [2] Collective Hyping Detection System for Identifying Online Spam Activities
    Zhang, Qinzhe
    Wu, Jia
    Zhang, Peng
    Long, Guodong
    Zhang, Chengqi
    IEEE INTELLIGENT SYSTEMS, 2017, 32 (05) : 53 - 63
  • [3] LOHIT: An online detection & control system for cellular SMS spam
    Dixit, Siddharth
    Gupta, Sandeep
    Ravishankar, Chinya V.
    Proceedings of the IASTED International Conference on Communication, Network, and Information Security, 2005, : 48 - 54
  • [4] Content-based analysis to detect Arabic web spam
    Al-Kabi, Mohammed
    Wahsheh, Heider
    Alsmadi, Izzat
    Al-Shawakfa, Emad
    Wahbeh, Abdullah
    Al-Hmoud, Ahmed
    JOURNAL OF INFORMATION SCIENCE, 2012, 38 (03) : 284 - 296
  • [5] Exploiting the Spam Correlations in Scalable Online Social Spam Detection
    Xu, Hailu
    Hu, Liting
    Liu, Pinchao
    Guan, Boyuan
    CLOUD COMPUTING - CLOUD 2019, 2019, 11513 : 146 - 160
  • [6] SPAR: A System to Detect Spam in Arabic Opinions
    Wahsheh, Heider A.
    Al-Kabi, Mohammed N.
    Alsmadi, Izzat M.
    2013 IEEE JORDAN CONFERENCE ON APPLIED ELECTRICAL ENGINEERING AND COMPUTING TECHNOLOGIES (AEECT), 2013,
  • [7] Harnessing the Nature of Spam in Scalable Online Social Spam Detection
    Xu, Hailu
    Guan, Boyuan
    Liu, Pinchao
    Escudero, William
    Hu, Liting
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 3733 - 3736
  • [8] Web spam detection with feature fusion
    Geng, Guanggang
    Zhu, Pengfei
    Wang, Deliang
    Journal of Computational Information Systems, 2009, 5 (03): : 1511 - 1519
  • [9] Novel Features for Web Spam Detection
    Kumar, Santosh
    Gao, Xiaoying
    Welch, Ian
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 593 - 597
  • [10] Web spam detection with learning to rank
    Liu, Yang
    Zhang, Huaxiang
    Liu, Li
    Journal of Computational Information Systems, 2013, 9 (24): : 10005 - 10012