Studying the XML Web: Gathering statistics from an XML sample

被引:27
|
作者
Barbosa, D
Mignet, L
Veltri, P
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3G5, Canada
[2] IBM India Res Lab, New Delhi 110016, India
[3] Magna Graecia Univ Catanzaro, Dept Expt & Clin Med, I-88100 Catanzaro, Italy
[4] INRIA, Paris, France
来源
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2005年 / 8卷 / 04期
关键词
World Wide Web; XML; XML web; XML Documents; XML processing tools;
D O I
10.1007/s11280-005-1544-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
XML has emerged as the language for exchanging data on the web and has attracted considerable interest both in industry and in academia. Nevertheless, to date, little is known about the XML documents published on the web. This paper presents a comprehensive analysis of a sample of about 200,000 XML documents on the web, and is the first study of its kind. We study the distribution of XML documents across the web in several ways; moreover, we provided a detailed characterization of the structure of real XML documents. Our results provide valuable input to the design of algorithms, tools and systems that use XML in one form or another.
引用
收藏
页码:413 / 438
页数:26
相关论文
共 50 条
  • [41] Testing web services by XML perturbation
    Xu, Wuzhi
    Offutt, Jeff
    Luo, Juan
    16TH IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 2005, : 257 - 266
  • [42] On the improvement of XML web services security
    Abuelyaman, E
    Brammeier, B
    SAM '05: Proceedings of the 2005 International Conference on Security and Management, 2005, : 253 - 259
  • [43] An application of XML in PDMS based on Web
    Wang, Baobao
    Wavelet Active Media Technology and Information Processing, Vol 1 and 2, 2006, : 481 - 485
  • [44] Possible attacks on XML Web Services
    Moradian, Esmiralda
    Hakansson, Anne
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (1B): : 154 - 170
  • [45] XML for the World Wide Web.
    Ziener, C
    LIBRARY JOURNAL, 2001, 126 (01) : 144 - 144
  • [46] Handling interlinked XML instances on the Web
    Behrends, E
    Fritzen, O
    May, W
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 792 - 810
  • [47] XML Web Services: The global computer?
    Gordon, AD
    FOUNDATIONS OF INFORMATION TECHNOLOGY IN THE ERA OF NETWORK AND MOBILE COMPUTING, 2002, 96 : 355 - 355
  • [48] XML Web Service技术初探
    程仁贵
    南平师专学报, 2004, (02) : 9 - 11+15
  • [49] Web services based on PROLOG and XML
    Heumesser, BD
    Ludwig, A
    Seipel, D
    APPLICATIONS OF DECLARATIVE PROGRAMMING AND KNOWLEDGE MANAGEMENT, 2005, 3392 : 245 - 257
  • [50] XML and web services go mobile
    Sharp, Kevin R.
    Supply Chain Syst. Mag., 3 (38-39):