A Search Engine Based on Query Logs, and Search Log Analysis by Automatic Language Identification

被引:0
|
作者
Oakes, Michael [1 ]
Xu, Yan [1 ]
机构
[1] Univ Sunderland, DGIC, Dept Comp Engn & Technol, Sunderland SR6 0DD, England
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work describes a variation on the traditional Information Retrieval paradigm, where instead of text documents being indexed according to their content, they are indexed according to the search terms previous users have used in finding them. We determine the effectiveness of this approach by indexing a sample of query logs from the European Library, and describe its usefulness for multilingual searching. In our analysis of the search logs, we determine the language of the past queries automatically, and annotate the search logs accordingly. From this information, we derive matrices to show that a) users tend to persist with the same query language throughout a query session, and b) submit queries in the same language as the interface they have selected.
引用
收藏
页码:526 / 533
页数:8
相关论文
共 50 条
  • [41] Adding Synonym Query for Chinese Language to Lucene Search Engine
    Zhao, Xu
    Xu, Wenbo
    Chai, Zhilei
    DCABES 2008 PROCEEDINGS, VOLS I AND II, 2008, : 426 - 432
  • [42] Neural network applications for automatic new topic identification of FAST and Excite search engine transaction logs
    Ozmutlu, Seda
    Ozmutlu, Huseyin C.
    Cosar, Gencer C.
    EXPERT SYSTEMS, 2011, 28 (02) : 101 - 122
  • [43] Using Monte-Carlo simulation for automatic new topic identification of search engine transaction logs
    Ozmutlu, Seda
    Ozmutlu, Huseyin C.
    Buyuk, Buket
    PROCEEDINGS OF THE 2007 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2007, : 2285 - 2293
  • [44] A Monte-Carlo simulation application for automatic new topic identification of search engine transaction logs
    Ozmutlu, Seda
    Ozmutlu, Huseyin C.
    Buyuk, Buket
    SIMULATION MODELLING PRACTICE AND THEORY, 2008, 16 (05) : 519 - 538
  • [45] Neural network applications for automatic new topic identification on excite web search engine data logs
    Özmutlu, HC
    Çavdur, F
    Özmutlu, S
    Spink, A
    ASIST 2004: PROCEEDINGS OF THE 67TH ASIS&T ANNUAL MEETING, VOL 41, 2004: MANAGING AND ENHANCING INFORMATION: CULTURES AND CONFLICTS, 2004, 41 : 310 - 316
  • [46] Search Query Language Identification Using Weak Labeling
    Tambi, Ritiz
    Kale, Ajinkya
    King, Tracy Holloway
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3520 - 3527
  • [47] Research on Query Results Cache Based on Log Analysis in Web Search Engines
    Ma, Hongyuan
    2013 3RD INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, COMMUNICATIONS AND NETWORKS (CECNET), 2013, : 551 - 554
  • [48] Mining search engine query log for evaluating content and structure of a web site
    Hosseini, Mehdi
    Abolhassani, Hassan
    PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE: WI 2007, 2007, : 235 - 241
  • [49] Ranking Keyword Search Results with Query Logs
    Zhou, Jing
    Yu, Xiaohui
    Liu, Yang
    Yu, Ziqiang
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 770 - 771
  • [50] Segmenting Search Query Logs by Learning to Detect Search Task Boundaries
    Lugo, Luis
    Moreno, Jose G.
    Hubert, Gilles
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 2037 - 2040