OLAWSDS: An Online Arabic Web Spam Detection System

被引:0
|
作者
Al-Kabi, Mohammed N. [1 ]
Wahsheh, Heider A. [2 ]
Alsmadi, Izzat M. [3 ]
机构
[1] Zarqa Univ, Fac Sci & IT, Zarqa, Jordan
[2] King Khalid Univ, Coll Comp Sci, Dept Comp Sci, Abha, Saudi Arabia
[3] Prince Sultan Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11586, Saudi Arabia
关键词
Arabic Web spam; content-based; link-based; Information Retrieval;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
For marketing purposes, Some Websites designers and administrators use illegal Search Engine Optimization (SEO) techniques to optimize the ranking of their Web pages and mislead the search engines. Some Arabic Web pages use both content and link features, to increase artificially the rank of their Web pages in the Search Engine Results Pages (SERPs). This study represents an enhancement to previous work in this field. It includes the design and implementation of an online Arabic Web spam detection system, based on algorithms and mathematical foundations, which can detect the Arabic content and link web spam depending on the tree of the spam detection conditions, beside depending on the user's feedback through a custom Web browser. The users can participate in making the decision about any Web page, through their feedbacks, so they judge if the Arabic Web pages in the browser are relevant for their particular queries or not. The proposed system uses the extracted content and link features from Arabic Web pages to determine whether to label each Web page as a spam or as a nonspam. This system also attempts to learn from the user's feedback to enhance automatically its performance. Statistical analysis is adopted in this study to evaluate the proposed system. Statistical Package for the Social Sciences (SPSS) software is used to evaluate this new system which considers the users feedbacks as dependent variables, while Arabic content and links features on the other hand are considered independent variables. The statistical analysis with the SPSS is used to apply a variety of tests, such as the test of the analysis of variance (ANOVA). ANOVA is used to show the relationships between the dependent and independent variables in the dataset, which leads to solving problems and building intelligent decisions and results.
引用
收藏
页码:105 / 110
页数:6
相关论文
共 50 条
  • [31] An effective feature selection method for web spam detection
    Asdaghi, Faeze
    Soleimani, Ali
    KNOWLEDGE-BASED SYSTEMS, 2019, 166 : 198 - 206
  • [32] Combining Textual Content and Hyperlinks in Web Spam Detection
    Javier Ortega, F.
    Macdonald, Craig
    Troyano, Jose A.
    Cruz, Fermin L.
    Enriquez, Fernando
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2011, 6716 : 266 - 269
  • [33] Spam detection in online social networks by deep learning
    Ameen, Aso Khaleel
    Kaya, Buket
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [34] Using Machine Learning Algorithms to Detect Content-based Arabic Web Spam
    Wahsheh, Heider
    Abu Doush, Iyad
    Al-Kabi, Mohammed
    Alsmadi, Izzat
    Al-Shawakfa, Emad
    JOURNAL OF INFORMATION ASSURANCE AND SECURITY, 2012, 7 (01): : 14 - 23
  • [35] Advances in spam detection for email spam, web spam, social network spam, and review spam: ML-based and nature-inspired-based techniques
    Akinyelu, Andronicus A.
    JOURNAL OF COMPUTER SECURITY, 2021, 29 (05) : 473 - 529
  • [36] Enhancing Detection of Arabic Social Spam Using Data Augmentation and Machine Learning
    Alkadri, Abdullah M.
    Elkorany, Abeer
    Ahmed, Cherry
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [37] Automated Spam Review Detection Using Hybrid Deep Learning on Arabic Opinions
    Alwayle I.M.
    Al-Onazi B.B.
    Nour M.K.
    Alalayah K.M.
    Alaidarous K.M.
    Ahmed I.A.
    Mehanna A.S.
    Motwakel A.
    Computer Systems Science and Engineering, 2023, 46 (03): : 2947 - 2961
  • [38] An Online Linear Chinese Spam Emails Filtering System
    Qiu, Yongqin
    Xu, Yan
    Wang, Bin
    2010 2ND INTERNATIONAL CONFERENCE ON E-BUSINESS AND INFORMATION SYSTEM SECURITY (EBISS 2010), 2010, : 190 - 193
  • [39] A real-time framework for opinion spam detection in Arabic social networks
    Ezzat, Cherry A.
    Alkadri, Abdullah M.
    Elkorany, Abeer
    EGYPTIAN INFORMATICS JOURNAL, 2025, 29
  • [40] Adaptive Learning Ant Colony Optimization for Web Spam Detection
    Manaskasemsak, Bundit
    Jiarpakdee, Jirayus
    Rungsawang, Arnon
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, PART VI - ICCSA 2014, 2014, 8584 : 642 - 653