OLAWSDS: An Online Arabic Web Spam Detection System

被引:0
|
作者
Al-Kabi, Mohammed N. [1 ]
Wahsheh, Heider A. [2 ]
Alsmadi, Izzat M. [3 ]
机构
[1] Zarqa Univ, Fac Sci & IT, Zarqa, Jordan
[2] King Khalid Univ, Coll Comp Sci, Dept Comp Sci, Abha, Saudi Arabia
[3] Prince Sultan Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11586, Saudi Arabia
关键词
Arabic Web spam; content-based; link-based; Information Retrieval;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
For marketing purposes, Some Websites designers and administrators use illegal Search Engine Optimization (SEO) techniques to optimize the ranking of their Web pages and mislead the search engines. Some Arabic Web pages use both content and link features, to increase artificially the rank of their Web pages in the Search Engine Results Pages (SERPs). This study represents an enhancement to previous work in this field. It includes the design and implementation of an online Arabic Web spam detection system, based on algorithms and mathematical foundations, which can detect the Arabic content and link web spam depending on the tree of the spam detection conditions, beside depending on the user's feedback through a custom Web browser. The users can participate in making the decision about any Web page, through their feedbacks, so they judge if the Arabic Web pages in the browser are relevant for their particular queries or not. The proposed system uses the extracted content and link features from Arabic Web pages to determine whether to label each Web page as a spam or as a nonspam. This system also attempts to learn from the user's feedback to enhance automatically its performance. Statistical analysis is adopted in this study to evaluate the proposed system. Statistical Package for the Social Sciences (SPSS) software is used to evaluate this new system which considers the users feedbacks as dependent variables, while Arabic content and links features on the other hand are considered independent variables. The statistical analysis with the SPSS is used to apply a variety of tests, such as the test of the analysis of variance (ANOVA). ANOVA is used to show the relationships between the dependent and independent variables in the dataset, which leads to solving problems and building intelligent decisions and results.
引用
收藏
页码:105 / 110
页数:6
相关论文
共 50 条
  • [21] Towards Building A Personalized Online Web Spam Detector in Intelligent Web Browsers
    Dong, Cailing
    Zhou, Bin
    Zhou, Lina
    2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2013, : 585 - 590
  • [22] Online Spam Review Detection: A Survey of Literature
    Li He
    Xianzhi Wang
    Hongxu Chen
    Guandong Xu
    Human-Centric Intelligent Systems, 2022, 2 (1-2): : 14 - 30
  • [23] Improving Spam Detection in Online Social Networks
    Gupta, Arushi
    Kaushal, Rishabh
    2015 INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2015,
  • [24] A SURVEY ON ONLINE REVIEW SPAM DETECTION TECHNIQUES
    Rajamohana, S. P.
    Umamaheswari, K.
    Dharani, M.
    Vedackshya, R.
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN GREEN ENERGY AND HEALTHCARE TECHNOLOGIES (IGEHT), 2017,
  • [25] A Behavioral Spam Detection System
    Ibrahim, Asma
    Osman, Izzeldin Mohamed
    FUTURE COMPUTER, COMMUNICATION, CONTROL AND AUTOMATION, 2011, 119 : 77 - 81
  • [26] Detecting Web Spam using a Recovering Web Links System
    Araujo, Lourdes
    Martinez-Romo, Juan
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2009, (42): : 39 - 46
  • [27] An Autonomous Online Malicious Spam Email Detection System Using Extended RBF Network
    Ali, Siti-Hajar-Aminah
    Ozawa, Seiichi
    Nakazato, Junji
    Ban, Tao
    Shimamura, Jumpei
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [28] A Novel Set of Contextual Features for Web Spam Detection
    Asdaghi, Faeze
    Soleimani, Ali
    Zahedi, Morteza
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2020, 11 (01): : 321 - 339
  • [29] A Reputation Based Detection Technique to Cloaked Web Spam
    Sunil, A. Naga Venkata
    Sardana, Anjali
    2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, CONTROL AND INFORMATION TECHNOLOGY (C3IT-2012), 2012, 4 : 566 - 572
  • [30] Efficient Spam Detection across Online Social Networks
    Xu, Hailu
    Sun, Weiqing
    Javaid, Ahmad
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2016, : 225 - 230