A NEW APPROACH TOWARDS VERTICAL SEARCH ENGINES Intelligent Focused Crawling and Multilingual Semantic Techniques

被引:0
|
作者
Peters, Sybille [1 ]
Rueckemann, Claus-Peter [1 ]
Sander-Beuermann, Wolfgang [1 ]
机构
[1] LUH, Reg Rechenzentrum Niedersachsen RRZN, Hannover, Germany
关键词
Focused crawling; Search engine; Vertical search engine; Metadata; Educational research; Link analysis;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Search engines typically consist of a crawler which traverses the web retrieving documents and a search front-end which provides the user interface to the acquired information. Focused crawlers refine the crawler by intelligently directing it to predefined topic areas. The evolution of search engines today is expedited by supplying more search capabilities such as a search for metadata as well as search within the content text. Semantic web standards have supplied methods for augmenting webpages with metadata. Machine learning techniques are used where necessary to gather more metadata from unstructured webpages. This paper analyzes the effectiveness of techniques for vertical search engines with respect to focused crawling and metadata integration exemplarily in the field of "educational research". A search engine for these purposes implemented within the EERQI project is described and tested. The enhancement of focused crawling with the use of link analysis and anchor text classification is implemented and verified. A new heuristic score calculation formula has been developed for focusing the crawler. Full-texts and metadata from various multilingual sources are collected and combined into a common format.
引用
收藏
页码:181 / 186
页数:6
相关论文
共 50 条
  • [1] Focused crawling using latent semantic indexing - An application for vertical search engines
    Almpanidis, G
    Kotropoulos, C
    Pitas, I
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2005, 3652 : 402 - 413
  • [2] Combining text and link analysis for focused crawling - An application for vertical search engines
    Almpanidis, G.
    Kotropoulos, C.
    Pitas, I.
    INFORMATION SYSTEMS, 2007, 32 (06) : 886 - 908
  • [3] A Novel Web Crawling Method For Vertical Search Engines
    Pavani, Kolli
    Sajeev, G. P.
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 1488 - 1493
  • [4] A Web-Based Semantic Focused Crawling Approach
    Liu, Yongjian
    Ma, Deng
    Sun, Jianpeng
    2013 INTERNATIONAL CONFERENCE ON CYBER SCIENCE AND ENGINEERING (CYBERSE 2013), 2013, : 287 - 293
  • [5] Search engines crawling process optimization: a webserver approach
    Zineddine, Mhamed
    INTERNET RESEARCH, 2016, 26 (01) : 311 - 331
  • [6] Bayes Topic Prediction Model for Focused Crawling of Vertical Search Engine
    Zhang, Weihong
    Chen, Yong
    2014 IEEE COMPUTING, COMMUNICATIONS AND IT APPLICATIONS CONFERENCE (COMCOMAP), 2014, : 294 - 299
  • [7] Towards open decision support systems based on semantic focused crawling
    Jung, Jason J.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 3914 - 3922
  • [8] Towards a new approach to query search engines: the Search Tree visual language
    Paolino, Luca
    Sebillo, Monica
    Tortora, Genoveffa
    Vitiello, Giuliana
    SOFTWARE-PRACTICE & EXPERIENCE, 2010, 40 (08): : 735 - 750
  • [9] A Semantic Search Approach to Task-Completion Engines
    Garigliotti, Dario
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 1457 - 1457
  • [10] Classifying and ranking: The first step towards mining inside vertical search engines
    Guo, Hang
    Zhang, Jun
    Zhou, Lizhu
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, 4653 : 223 - +