Searching for explanatory Web pages using automatic query expansion

被引:0
|
作者
Tauchi, Manabu [1 ]
Ward, Nigel [1 ]
机构
[1] Univ Tokyo, Sch Engn, Tokyo, Japan
关键词
search engines; reranking; pseudo-feedback; local relevance density; terminology; Japanese;
D O I
10.1111/j.1467-8640.2007.00291.x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When one tries to use the Web as a dictionary or encyclopedia, entering some single term into a search engine, the highly ranked pages in the result can include irrelevant or useless sites. The problem is that single-term queries, if taken literally, underspecify the type of page the user wants. For such problems automatic query expansion, also known as pseudo-feedback, is often effective. In this method the top n documents returned by an initial retrieval are used to provide terms for a second retrieval. This paper contributes, first, new normalization techniques for query expansion, and second, a new way of computing the similarity between an expanded query and a document, the "local relevance density" metric, which complements the standard vector product metric. Both of these techniques are shown to be useful for single-term queries, in Japanese, in experiments done over the World Wide Web in early 2001.
引用
收藏
页码:3 / 14
页数:12
相关论文
共 50 条
  • [41] A New Approach for Automatic Query Expansion
    Hmeidi, Ismail
    Al-Badarneh, Amer
    Al-Qtaish, Ahmad A.
    BUSINESS TRANSFORMATION THROUGH INNOVATION AND KNOWLEDGE MANAGEMENT: AN ACADEMIC PERSPECTIVE, VOLS 3 AND 4, 2010, : 1975 - 1989
  • [42] Automatic metadata generation for Web pages using a text mining approach
    Yang, HC
    Lee, CH
    INTERNATIONAL WORKSHOP ON CHALLENGES IN WEB INFORMATION RETRIEVAL AND INTEGRATION, PROCEEDINGS, 2005, : 186 - 194
  • [43] Automatic Identification of Web Query Interfaces
    Marin-Castro, Heidy M.
    Sosa-Sosa, Victor J.
    Lopez-Arevalo, Ivan
    ADVANCES IN SOFT COMPUTING, PT II, 2011, 7095 : 297 - 306
  • [44] Supporting web query expansion efficiently using multi-granularity indexing and query processing
    Li, WS
    Agrawal, D
    DATA & KNOWLEDGE ENGINEERING, 2000, 35 (03) : 239 - 257
  • [45] SICS at CLEF 2002:: Automatic query expansion using random indexing
    Sahlgren, M
    Karlgren, J
    Cöster, R
    Järvinen, T
    ADVANCES IN CROSS-LANGUAGE INFORMATION RETRIEVAL, 2003, 2785 : 311 - 320
  • [46] Searching in MEDLINE: Query expansion and manual indexing evaluation
    Abdou, Samir
    Savoy, Jacques
    INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (02) : 781 - 789
  • [47] Query expansion with Naive Bayes for searching distributed collections
    Yang, H
    Zhang, MJ
    INTELLIGENT SYSTEMS, 2002, : 17 - 23
  • [48] The automatic identification of the emotion status of web pages
    John, David
    Boucouvalas, Anthony C.
    EUROMEDIA '2008, 2008, : 18 - +
  • [49] Automatic template detection for structured web pages
    Lo, Lawrence
    Ng, Vincent To-Yee
    Ng, Patrick
    Chan, Stephen C. F.
    2006 10TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, PROCEEDINGS, VOLS 1 AND 2, 2006, : 708 - 713
  • [50] Automatic control of simple language in web pages
    Jenge, Constantin
    Hartrumpf, Sven
    Helbig, Hermann
    Osswald, Rainer
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PROCEEDINGS, 2006, 4061 : 207 - 214