An Ontology-Based Topical Crawling Algorithm for Accessing Deep Web Content

被引:3
|
作者
Arya, K. V. [1 ]
Vadlamudi, Baby Ramya [1 ]
机构
[1] IIITM, ABV, Gwalior 474010, India
关键词
Focused crawler; Domain ontology; Deep web; Form processing;
D O I
10.1109/ICCCT.2012.10
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Due to the large volume of the Web information and relatively high speed of information update, the coverage and quality of the retrieved pages by modern search engines is comparatively small. Given the volume of the Web and its frequency of content change, the coverage and quality of pages retrieved by modern search engines is relatively small since they crawl only hypertext links ignoring the search forms which are the entry points for accessing deep web content where two-thirds of information is resides. In this paper an algorithm has been designed to enable topical crawlers to access hidden web content by using domain based ontology to determine the forms' relevance to the domain. In this work scientific research publications domain has been considered. Experimental results show that proposed approach is better as compared to keyword based crawlers in terms of both relevancy and completeness.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [1] An Ontology-Based adaptive Topical Crawling Algorithm
    Shen Jinxing
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS RESEARCH AND MECHATRONICS ENGINEERING, 2015, 121 : 1083 - 1088
  • [2] An Ontology-Based adaptive Topical Crawling Algorithm
    Shen Jin-Xing
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 12210 - 12213
  • [3] Ontology-based focused crawling of Deep Web sources
    Fang, Wei
    Cui, Zhiming
    Zhao, Pengpeng
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2007, 4798 : 514 - 519
  • [4] Ontology-based Focused Crawling
    Luong, Hiep Phuc
    Gauch, Susan
    Wang, Qiang
    INTERNATIONAL CONFERENCE ON INFORMATION, PROCESS, AND KNOWLEDGE MANAGEMENT: EKNOW 2009, PROCEEDINGS, 2009, : 123 - 128
  • [5] An Ontology-based Web Crawling Approach for the Retrieval of Materials in the Educational Domain
    Ibrahim, Mohammed
    Yang, Yanyan
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 900 - 906
  • [6] An ontology-based schema matching on deep web
    Zhang, Aiqi
    Zuo, Wanli
    Wang, Ying
    Ji, Wenyan
    Peng, Tao
    Journal of Computational Information Systems, 2010, 6 (04): : 1077 - 1084
  • [7] An Ontology-Based Framework for Semantic Web Content Mining
    Yasodha, S.
    Dhenakaran, S. S.
    2014 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2014,
  • [8] Towards ontology-based harmonization of Web content standards
    Guarino, N
    Welty, C
    Partridge, C
    CONCEPTUAL MODELING FOR E-BUSINESS AND THE WEB, PROCEEDINGS, 2000, 1921 : 1 - 6
  • [9] Towards ontology-based harmonization of web content standards
    Guarino, Nicola
    Welty, Christopher
    Partridge, Christopher
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2000, 1921 : 1 - 6
  • [10] An ontology-based approach to learnable focused crawling
    Zheng, Hai-Tao
    Kang, Bo-Yeong
    Kim, Hong-Gee
    INFORMATION SCIENCES, 2008, 178 (23) : 4512 - 4522