An Ontology-Based Topical Crawling Algorithm for Accessing Deep Web Content

被引:3
|
作者
Arya, K. V. [1 ]
Vadlamudi, Baby Ramya [1 ]
机构
[1] IIITM, ABV, Gwalior 474010, India
来源
2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGY (ICCCT) | 2012年
关键词
Focused crawler; Domain ontology; Deep web; Form processing;
D O I
10.1109/ICCCT.2012.10
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Due to the large volume of the Web information and relatively high speed of information update, the coverage and quality of the retrieved pages by modern search engines is comparatively small. Given the volume of the Web and its frequency of content change, the coverage and quality of pages retrieved by modern search engines is relatively small since they crawl only hypertext links ignoring the search forms which are the entry points for accessing deep web content where two-thirds of information is resides. In this paper an algorithm has been designed to enable topical crawlers to access hidden web content by using domain based ontology to determine the forms' relevance to the domain. In this work scientific research publications domain has been considered. Experimental results show that proposed approach is better as compared to keyword based crawlers in terms of both relevancy and completeness.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [31] Automatic Ontology-Based Annotation of Food, Nutrition and Health Arabic Web Content
    Albukhitan, Saeed
    Helmy, Tarek
    4TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2013), THE 3RD INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2013), 2013, 19 : 461 - 469
  • [32] Integrating Web services into ontology-based Web portal
    Zhou, J
    Yu, Y
    Zhang, L
    Lin, CX
    Yang, Y
    WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 : 585 - 596
  • [33] AI for the web - Ontology-based community web portals
    Staab, S
    Angele, J
    Decker, S
    Erdmann, M
    Hotho, A
    Maedche, A
    Schnurr, HP
    Studer, R
    Sure, Y
    SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), 2000, : 1034 - 1039
  • [34] CRAWLING DEEP WEB CONTENT THROUGH QUERY FORMS
    Liu, Jun
    Wu, Zhaohui
    Jiang, Lu
    Zheng, Qinghua
    Liu, Xiao
    WEBIST 2009: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2009, : 634 - +
  • [35] Ontology-based adaptive content navigation
    Melia, M
    Holohan, E
    McMullen, D
    Pahl, C
    Methods and Technologies for Learning, 2005, : 435 - 439
  • [36] Ontology-based learning content recommendation
    Shen, Li-Ping
    Shen, Rui-Min
    INTERNATIONAL JOURNAL OF CONTINUING ENGINEERING EDUCATION AND LIFE-LONG LEARNING, 2005, 15 (3-6) : 308 - 317
  • [37] Ontology-based information content computation
    Sanchez, David
    Batet, Montserrat
    Isern, David
    KNOWLEDGE-BASED SYSTEMS, 2011, 24 (02) : 297 - 303
  • [38] Self-Adaptive Ontology-based Focused Crawling: A Literature Survey
    Khan, Mohd. Aamir
    Sharma, Dilip Kumar
    2016 5TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO), 2016, : 595 - 601
  • [39] Ontology-Based Content Model for Scalable Content Reuse
    Nesic, Sasa
    Jazayeri, Mehdi
    Jovanovic, Jelena
    Gasevic, Dragan
    K-CAP'07: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON KNOWLEDGE CAPTURE, 2007, : 195 - 196
  • [40] An ontology-based services composition algorithm
    Verdie, Jean-Charles
    Herin, Daniele
    Sala, Michel
    WEBIST 2006: Proceedings of the Second International Conference on Web Information Systems and Technologies: INTERNET TECHNOLOGY / WEB INTERFACE AND APPLICATIONS, 2006, : 461 - 464