Using high performance systems to build collections for a digital library

被引:0
|
作者
Bergmark, D [1 ]
机构
[1] Cornell Univ, Digital Lab Res Grp, Ithaca, NY 14853 USA
来源
2002 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDINGS OF THE WORKSHOPS | 2002年
关键词
D O I
10.1109/ICPPW.2002.1039762
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nothing is more distributed than the Web, with its content spread across thousands of servers. High performance hardware and software is essential for an effective down-load, analysis, and organization of this content. We describe our experience with a highly parallel Web crawling system (Mercator) to construct - automatically - collections of scientific resources for the National Science Digital Library.
引用
收藏
页码:431 / 438
页数:8
相关论文
共 50 条