Studying the XML Web: Gathering statistics from an XML sample

被引:27
|
作者
Barbosa, D
Mignet, L
Veltri, P
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3G5, Canada
[2] IBM India Res Lab, New Delhi 110016, India
[3] Magna Graecia Univ Catanzaro, Dept Expt & Clin Med, I-88100 Catanzaro, Italy
[4] INRIA, Paris, France
来源
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2005年 / 8卷 / 04期
关键词
World Wide Web; XML; XML web; XML Documents; XML processing tools;
D O I
10.1007/s11280-005-1544-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
XML has emerged as the language for exchanging data on the web and has attracted considerable interest both in industry and in academia. Nevertheless, to date, little is known about the XML documents published on the web. This paper presents a comprehensive analysis of a sample of about 200,000 XML documents on the web, and is the first study of its kind. We study the distribution of XML documents across the web in several ways; moreover, we provided a detailed characterization of the structure of real XML documents. Our results provide valuable input to the design of algorithms, tools and systems that use XML in one form or another.
引用
收藏
页码:413 / 438
页数:26
相关论文
共 50 条
  • [31] Towards XML Schema Extraction from Deep Web
    Saissi, Yasser
    Zellou, Ahmed
    Idri, Ali
    2016 4TH IEEE INTERNATIONAL COLLOQUIUM ON INFORMATION SCIENCE AND TECHNOLOGY (CIST), 2016, : 94 - 99
  • [32] On mining XML structures based on statistics
    Ishikawa, H
    Yokoyama, S
    Ohta, M
    Katayama, K
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2005, 3681 : 379 - 390
  • [33] Data on the Web: From relations to semistructured data and XML
    Wiley, DL
    ECONTENT, 2000, 23 (04) : 93 - 93
  • [34] The TEIViewer: Facilitating the transition from XML to web display
    Schlitz, Stephanie A.
    Bodine, Garrick S.
    LITERARY AND LINGUISTIC COMPUTING, 2009, 24 (03): : 339 - 346
  • [35] 基于XML的Web系统
    崔应杰
    张景
    李军怀
    孙东东
    李朋
    计算机工程, 2004, (04) : 58 - 60
  • [36] Enhancements to securing XML Web Services
    Heinz, E
    Dogdu, E
    IC'04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET COMPUTING, VOLS 1 AND 2, 2004, : 808 - 814
  • [37] XML arouse the web architecture revolution
    Chai, XL
    Cao, J
    Gao, YQ
    Shi, BL
    INTERNET APPLICATIONS, 1999, 1749 : 461 - 466
  • [38] XML-based web will be next
    Anon
    Newspaper Techniques, 2002, (JUL.):
  • [39] Web服务与XML解析
    冯志慧
    闾素红
    长春师范学院学报, 2006, (04) : 59 - 60
  • [40] XML Web服务技术探讨
    杨艳
    唐胜群
    张文涛
    计算机应用研究, 2002, (10) : 96 - 98+104