DeepSearcher: A Chinese deep web classified search engine

被引:0
|
作者
Institute of Intelligent Information Processing and Application, Soochow University, Suzhou 215006, China [1 ]
机构
来源
J. Comput. Inf. Syst. | 2008年 / 1卷 / 111-118期
关键词
Automatic classification - Back end databases - Crawlable pages - Data integration - Deep web - Domain hierarchy;
D O I
暂无
中图分类号
学科分类号
摘要
Web search engines work well in finding crawlable pages, but not well in finding pages dynamically generated by back-end databases. These pages are often referred to as the Deep Web. According to many studies, the Deep Web reside in topic-specific back-end databases, and the size of the Deep Web increases rapidly. Moreover, contents provided by Deep Web are often of high quality and well-structured. Organizing such structured sources into a domain hierarchy will make users conveniently browse to find these valuable resources and this is one of the critical steps toward large-scale integration of heterogeneous Deep Web sources. We design a prototype of Chinese Deep Web classified search engine called DeepSearcher. We also propose a Deep Web crawling strategy and algorithm for Deep Web judgment and classification. Our experimental results indicate that this approach can achieve good results.
引用
收藏
相关论文
共 50 条
  • [21] Web search engine based on DNS
    Wang Liang
    Guo Yi-Ping
    Fang Ming
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2007, 30 (02) : 466 - 478
  • [22] ExpertRec: A Collaborative Web Search Engine
    Sun, Jingyu
    Chen, Junjie
    Yu, Xueli
    Zhong, Ning
    WEB INFORMATION SYSTEMS AND MINING, PT II, 2011, 6988 : 385 - +
  • [23] MediCrawl - A Web Search Engine For Diseases
    Trivedi, Devharsh
    Gopalakrishnan, Vaishnavi
    2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 148 - 157
  • [24] IMPLEMENTATION OF A SIMPLE WEB SEARCH ENGINE
    Saveluc, Diana-Alexandra
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE 'LINQUISTIC RESOURCES AND TOOLS FOR PROCESSING THE ROMANIAN LANGUAGE', 2015, 2015, : 163 - 174
  • [25] Web search engine as a bee hive
    Navrat, Pavol
    Kovacik, Martin
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 694 - +
  • [26] A Framework of Web Image Search Engine
    Xu, Weiguang
    Zhang, Yafei
    Lu, Jianjiang
    Li, Ran
    Xie, Zhenghui
    FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 522 - 525
  • [27] Web search engine multimedia functionality
    Tjondronegoro, Dian
    Spink, Amanda
    INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (01) : 340 - 357
  • [28] Research of segmentation of Chinese texts in Chinese search engine
    Zhou, LX
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 2627 - 2631
  • [29] Falcons Concept Search: A Practical Search Engine for Web Ontologies
    Qu, Yuzhong
    Cheng, Gong
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2011, 41 (04): : 810 - 816
  • [30] Optimization of Web Search Engine and Its Application to Web Mining
    CHEN Hao1
    2. Software School
    3. Department of Computer Science and Technology
    WuhanUniversityJournalofNaturalSciences, 2009, 14 (02) : 115 - 118