search engine;
authority of pages;
PageRank;
hyperlink analysis;
D O I:
暂无
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
With die enormous growth of Web pages, an increasing number of users are relying on Web search engine. But now it is impossible for spider to retrieval all pages on the Web. Furthermore, because of replicated contents the Spider needn't to retrieve all Web pages. Apparently, visiting important pages firstly can be an efficient method. So evaluating the authority of Web pages has becoming a key problem. Moreover, if we can compute die authority of all pages appropriately their the search engine can return important pages when processing the queries. Then users can obtain satisfying results. Google is one of the best search engines. It can produce much more satisfying search results than many other existing systems. It has a great progress on both precision and recall based on hyperlink analysis. This paper analyzed Google's Algorithm on PageRank in details and presented some disadvantages of this algorithm, for instance, preferring old pages, ignoring special sites and inaccurate judge of hyperlinks pointed out from one page. Furthermore, we describe our improved algorithm. Experiments show that our consideration on evaluating the importance of pages can make an improvement over the original algorithm.
机构:
Department of Social Sciences, The Nottingham Trent University, Nottingham, United KingdomDepartment of Social Sciences, The Nottingham Trent University, Nottingham, United Kingdom
Miller, Hugh
Arnold, Jill
论文数: 0引用数: 0
h-index: 0
机构:
Department of Social Sciences, The Nottingham Trent University, Nottingham, United KingdomDepartment of Social Sciences, The Nottingham Trent University, Nottingham, United Kingdom
Arnold, Jill
Computers and Education,
2000,
34
(3-4):
: 335
-
339