A sampling method based on URL clustering for fast web accessibility evaluation

被引:9
|
作者
Zhang, Meng-ni [1 ]
Wang, Can [1 ]
Bu, Jia-jun [1 ]
Yu, Zhi [1 ]
Zhou, Yu [1 ]
Chen, Chun [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Page sampling; URL clustering; Web accessibility evaluation;
D O I
10.1631/FITEE.1400377
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When evaluating the accessibility of a large website, we rely on sampling methods to reduce the cost of evaluation. This may lead to a biased evaluation when the distribution of checkpoint violations in a website is skewed and the selected samples do not provide a good representation of the entire website. To improve sampling quality, stratified sampling methods first cluster web pages in a site and then draw samples from each cluster. In existing stratified sampling methods, however, all the pages in a website need to be analyzed for clustering, causing huge I/O and computation costs. To address this issue, we propose a novel page sampling method based on URL clustering for web accessibility evaluation, namely URLSamp. Using only the URL information for stratified page sampling, URLSamp can efficiently scale to large websites. Meanwhile, by exploiting similarities in URL patterns, URLSamp cluster pages by their generating scripts and can thus effectively detect accessibility problems from web page templates. We use a data set of 45 web sites to validate our method. Experimental results show that our URLSamp method is both effective and efficient for web accessibility evaluation.
引用
收藏
页码:449 / 456
页数:8
相关论文
共 50 条
  • [21] Tools for the evaluation of Web accessibility
    Serrano Mascaraque, Esmeralda
    DOCUMENTACION DE LAS CIENCIAS DE LA INFORMACION, 2009, 32 : 245 - 266
  • [22] Determination and evaluation of web accessibility
    Boldyreff, C
    WET ICE 2002: ELEVENTH IEEE INTERNATIONAL WORKSHOPS ON ENABLING TECHNOLOGIES: INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES, PROCEEDINGS, 2002, : 35 - 40
  • [23] Design and Implementation of a New Web Anti-attack Method Based on URL Randomization
    Liu, Wei
    Wu, Chengrong
    Jin, Haolin
    Zhang, Shiyong
    2016 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (CITS), 2016, : 144 - 150
  • [24] A full scope web accessibility evaluation procedure proposal based on Iberian eHealth accessibility compliance
    Martins, Jose
    Goncalves, Ramiro
    Branco, Frederico
    COMPUTERS IN HUMAN BEHAVIOR, 2017, 73 : 676 - 684
  • [25] Fast Accessibility Evaluation of the Main-Belt Asteroids Manned Exploration Mission Based on a Learning Method
    Zhu, Yuehe
    Luo, Yazhong
    Yao, Wen
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 2437 - 2444
  • [26] Fast fuzzy clustering of Web documents
    Wang, Jian-Hui
    Jiang, Long-Bin
    Yang, Shu
    Chang'an Daxue Xuebao (Ziran Kexue Ban)/Journal of Chang'an University (Natural Science Edition), 2007, 27 (02): : 107 - 110
  • [27] Automatic evaluation of mobile web accessibility
    Arrue, Myriam
    Vigo, Markel
    Abascal, Julio
    UNIVERSAL ACCESS IN AMBIENT INTELLIGENCE ENVIRONMENTS, 2007, 4397 : 244 - +
  • [28] Method of clustering web pages based on granular computing
    Hu, Jun
    Guan, Chun
    Liu, Bocheng
    Hu, J., 2013, Asian Network for Scientific Information (13) : 2107 - 2110
  • [29] A Method for Web Documents Clustering Based on Dynamic Concept
    Wang, Yunhua
    Ke, Huiyan
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 2183 - 2187
  • [30] The design and evaluation of accessibility on web navigation
    Yen, Benjamin P. -C.
    DECISION SUPPORT SYSTEMS, 2007, 42 (04) : 2219 - 2235