A sampling method based on URL clustering for fast web accessibility evaluation

被引:9
|
作者
Zhang, Meng-ni [1 ]
Wang, Can [1 ]
Bu, Jia-jun [1 ]
Yu, Zhi [1 ]
Zhou, Yu [1 ]
Chen, Chun [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Page sampling; URL clustering; Web accessibility evaluation;
D O I
10.1631/FITEE.1400377
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When evaluating the accessibility of a large website, we rely on sampling methods to reduce the cost of evaluation. This may lead to a biased evaluation when the distribution of checkpoint violations in a website is skewed and the selected samples do not provide a good representation of the entire website. To improve sampling quality, stratified sampling methods first cluster web pages in a site and then draw samples from each cluster. In existing stratified sampling methods, however, all the pages in a website need to be analyzed for clustering, causing huge I/O and computation costs. To address this issue, we propose a novel page sampling method based on URL clustering for web accessibility evaluation, namely URLSamp. Using only the URL information for stratified page sampling, URLSamp can efficiently scale to large websites. Meanwhile, by exploiting similarities in URL patterns, URLSamp cluster pages by their generating scripts and can thus effectively detect accessibility problems from web page templates. We use a data set of 45 web sites to validate our method. Experimental results show that our URLSamp method is both effective and efficient for web accessibility evaluation.
引用
收藏
页码:449 / 456
页数:8
相关论文
共 50 条
  • [31] Web accessibility evaluation via XSLT
    Centeno, Vicente Luque
    Kloos, Carlos Delgado
    M Blazquez del Toro, Jose
    Gaedke, Martin
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2007 WORKSHOPS, 2007, 4832 : 459 - +
  • [32] Evaluation of Web Accessibility on the Maps Domain
    Medina, Jonathas Leontino
    Cagnin, Maria Istela
    Paiva, Debora Maria Barroso
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 157 - 162
  • [33] Collaborative Web Accessibility Evaluation: An EARL-Based Workflow Approach
    Lopez-Gil, Juan-Miguel
    Diaz, Oscar
    Iturria, Mikel
    WEB ENGINEERING, ICWE 2023, 2023, 13893 : 206 - 220
  • [34] Building a web-snippet clustering system based on a mixed clustering method
    Chen, Lin-Chih
    ONLINE INFORMATION REVIEW, 2011, 35 (04) : 611 - 635
  • [35] A Fast Density-Grid Based Clustering Method
    Brown, Daniel
    Japa, Arialdis
    Shi, Yong
    2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, : 48 - 54
  • [36] A new fast prototype selection method based on clustering
    J. Arturo Olvera-López
    J. Ariel Carrasco-Ochoa
    J. Francisco Martínez-Trinidad
    Pattern Analysis and Applications, 2010, 13 : 131 - 141
  • [37] A KPI Anomaly Detection Method Based on Fast Clustering
    Wu, Yun
    Shi, Yu
    Yang, Jieming
    Bao, Lishan
    LI, Chunzhe
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2022, E105B (11) : 1309 - 1317
  • [38] CLUSTERING METHOD FOR FAST CONTENT BASED IMAGE RETRIEVAL
    Kinoshenko, Dmitry
    Mashtalir, Vladimir
    Yegorova, Elena
    COMPUTER VISION AND GRAPHICS (ICCVG 2004), 2006, 32 : 946 - 952
  • [39] A new fast prototype selection method based on clustering
    Arturo Olvera-Lopez, J.
    Ariel Carrasco-Ochoa, J.
    Francisco Martinez-Trinidad, J.
    PATTERN ANALYSIS AND APPLICATIONS, 2010, 13 (02) : 131 - 141
  • [40] DIDES: a fast and effective sampling for clustering algorithm
    Frédéric Ros
    Serge Guillaume
    Knowledge and Information Systems, 2017, 50 : 543 - 568