An Ontology-Based Topical Crawling Algorithm for Accessing Deep Web Content

被引:3
|
作者
Arya, K. V. [1 ]
Vadlamudi, Baby Ramya [1 ]
机构
[1] IIITM, ABV, Gwalior 474010, India
来源
2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGY (ICCCT) | 2012年
关键词
Focused crawler; Domain ontology; Deep web; Form processing;
D O I
10.1109/ICCCT.2012.10
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Due to the large volume of the Web information and relatively high speed of information update, the coverage and quality of the retrieved pages by modern search engines is comparatively small. Given the volume of the Web and its frequency of content change, the coverage and quality of pages retrieved by modern search engines is relatively small since they crawl only hypertext links ignoring the search forms which are the entry points for accessing deep web content where two-thirds of information is resides. In this paper an algorithm has been designed to enable topical crawlers to access hidden web content by using domain based ontology to determine the forms' relevance to the domain. In this work scientific research publications domain has been considered. Experimental results show that proposed approach is better as compared to keyword based crawlers in terms of both relevancy and completeness.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [21] Genetic algorithm for evaluation metrics in topical web crawling
    Peng, T.
    Zuo, W. L.
    Lin, Y. L.
    COMPUTATIONAL METHODS, PTS 1 AND 2, 2006, : 1203 - +
  • [22] Ontology based web crawling - A novel approach
    Ganesh, S
    ADVANCES IN WEB INTELLIGENCE, PROCEEDINGS, 2005, 3528 : 140 - 149
  • [23] Ontology-Based Deep Web Data Interface Schemas Integration Method
    Wang Rui
    Wang Nianbin
    2010 2ND INTERNATIONAL CONFERENCE ON E-BUSINESS AND INFORMATION SYSTEM SECURITY (EBISS 2010), 2010, : 182 - 185
  • [24] An algorithm of deep web crawler's crawling
    Xiang Peisu
    Tian Ke
    Huang Qinzhen
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 1259 - +
  • [25] Ontology-based Web navigation assistant
    Jung, H
    Yang, JY
    Choi, J
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 443 - 448
  • [26] Ontology-based web knowledge management
    Wang, YM
    Yang, ZH
    Kong, PHH
    Gay, RKL
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1859 - 1863
  • [27] An Ontology-Based Crawler for the Semantic Web
    Van de Maele, Felix
    Spyns, Peter
    Meersman, Robert
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2008 WORKSHOPS, 2008, 5333 : 1056 - +
  • [28] Ontology-Based Web Information Extraction
    Mo, Qian
    Chen, Yi-hong
    COMMUNICATIONS AND INFORMATION PROCESSING, PT 1, 2012, 288 : 118 - 126
  • [29] Ontology-Based Administration of Web Directories
    Horvat, Marko
    Gledec, Gordan
    Bogunovic, Nikola
    TRANSACTIONS ON COMPUTATIONAL COLLECTIVE INTELLIGENCE I, 2010, 6220 : 101 - 120
  • [30] Ontology-Based Web Application Testing
    Paydar, Samad
    Kahani, Mohsen
    NOVEL ALGORITHMS AND TECHNIQUES IN TELECOMMUNICATIONS AND NETWORKING, 2010, : 23 - 27