DeepSearcher: A Chinese deep web classified search engine

被引:0
|
作者
Institute of Intelligent Information Processing and Application, Soochow University, Suzhou 215006, China [1 ]
机构
来源
J. Comput. Inf. Syst. | 2008年 / 1卷 / 111-118期
关键词
Automatic classification - Back end databases - Crawlable pages - Data integration - Deep web - Domain hierarchy;
D O I
暂无
中图分类号
学科分类号
摘要
Web search engines work well in finding crawlable pages, but not well in finding pages dynamically generated by back-end databases. These pages are often referred to as the Deep Web. According to many studies, the Deep Web reside in topic-specific back-end databases, and the size of the Deep Web increases rapidly. Moreover, contents provided by Deep Web are often of high quality and well-structured. Organizing such structured sources into a domain hierarchy will make users conveniently browse to find these valuable resources and this is one of the critical steps toward large-scale integration of heterogeneous Deep Web sources. We design a prototype of Chinese Deep Web classified search engine called DeepSearcher. We also propose a Deep Web crawling strategy and algorithm for Deep Web judgment and classification. Our experimental results indicate that this approach can achieve good results.
引用
收藏
相关论文
共 50 条
  • [1] DeepSearcher: A One-Time Searcher for Deep Web
    Shen, Derong
    Sun, Gaoshang
    Nie, Tiezheng
    Kou, Yue
    HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 3, PROCEEDINGS, 2009, : 273 - 277
  • [2] Deep Web Performance Enhance on Search Engine
    Kumar, Deepak
    Mishra, Rajesh
    2015 INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND IMPLEMENTATIONS (ICSCTI), 2015,
  • [3] A Novel Interdisciplinary Approach to Deep Web Search Engine
    Tin, Pyke
    Zin, Thi Thi
    Hama, Hiromitsu
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (11): : 169 - 176
  • [4] Web searching in Chinese: A study of a search engine in Hong Kong
    Chau, Michael
    Fang, Xiao
    Yang, Christopher C.
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (07): : 1044 - 1054
  • [5] An Empirical Analysis of Paid Placement in Chinese Web Search Engine Results
    Long, Haiquan
    Lv, Benfu
    Peng, Geng
    Chen, Jie
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 12253 - 12256
  • [6] A Web-Based Search Engine for Chinese Calligraphic Manuscript Images
    Zhuang, Yi
    Jiang, Nan
    Hu, Haiyang
    ADVANCES IN WEB BASED LEARNING - ICWL 2009, 2009, 5686 : 464 - +
  • [7] A Declarative Query Language Enabled Autonomous Deep Web Search Engine
    Naha, Kallol
    Jamil, Hasan M.
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 305 - 312
  • [8] Web Search Engine Research
    Isfandyari-Moghaddam, Alireza
    ELECTRONIC LIBRARY, 2013, 31 (03): : 403 - 404
  • [9] Web Search Engine Research
    Cazan, Constantin
    INFORMATION-WISSENSCHAFT UND PRAXIS, 2012, 63 (06): : 394 - 395
  • [10] Web Search Engine Research
    MacFarlane, Andrew
    JOURNAL OF DOCUMENTATION, 2013, 69 (04) : 594 - 596