Searching for explanatory Web pages using automatic query expansion

被引:0
|
作者
Tauchi, Manabu [1 ]
Ward, Nigel [1 ]
机构
[1] Univ Tokyo, Sch Engn, Tokyo, Japan
关键词
search engines; reranking; pseudo-feedback; local relevance density; terminology; Japanese;
D O I
10.1111/j.1467-8640.2007.00291.x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When one tries to use the Web as a dictionary or encyclopedia, entering some single term into a search engine, the highly ranked pages in the result can include irrelevant or useless sites. The problem is that single-term queries, if taken literally, underspecify the type of page the user wants. For such problems automatic query expansion, also known as pseudo-feedback, is often effective. In this method the top n documents returned by an initial retrieval are used to provide terms for a second retrieval. This paper contributes, first, new normalization techniques for query expansion, and second, a new way of computing the similarity between an expanded query and a document, the "local relevance density" metric, which complements the standard vector product metric. Both of these techniques are shown to be useful for single-term queries, in Japanese, in experiments done over the World Wide Web in early 2001.
引用
收藏
页码:3 / 14
页数:12
相关论文
共 50 条
  • [21] Searching Web pages based on predefined strings
    Karar, Mete
    Gulec, Kadir
    Carkacioglu, Abdurrahman
    2006 IEEE 14TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1 AND 2, 2006, : 563 - +
  • [22] World Wide Web CBIR Searching Using Query by Approximate Shapes
    Deniziak, Roman Stanislaw
    Michno, Tomasz
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2019, 801 : 85 - 93
  • [23] Improving MEDLINE document retrieval using automatic query expansion
    Yoo, Sooyoung
    Choi, Jinwook
    ASIAN DIGITAL LIBRARIES: LOOKING BACK 10 YEARS AND FORGING NEW FRONTIERS, PROCEEDINGS, 2007, 4822 : 241 - 249
  • [24] A Web Pages Automatic Filtering System
    Nouali, O.
    Saidi, A.
    Chahrat, H.
    Krinah, A.
    Toursel, B.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 5, 2005, 5 : 212 - 215
  • [25] Automatic recognition of news web pages
    Zhu, Zhu
    Wu, Gong-Qing
    Wu, Xindong
    Hu, Xue-Gang
    Wang, Fei-Yue
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2008, 5075 : 496 - +
  • [26] Analysis and performance of morphological query expansion and language-filtering words on Basque web searching
    Leturia, I.
    Gurrutxaga, A.
    Areta, N.
    Pociello, E.
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 720 - 727
  • [27] A Study on Automatic Web Pages Categorization
    Sun Bo
    Sun Qiurui
    Chen Zhong
    Fu Zengmei
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1423 - 1427
  • [28] Automatic recognition of news web pages
    School of Computer Science and Information Engineering, Hefei University of Technology, Heifei
    230009, China
    不详
    VT
    50405, United States
    不详
    Lect. Notes Comput. Sci., 2008, (496-501):
  • [29] Exploiting Underrepresented Query Aspects for Automatic Query Expansion
    Crabtree, Daniel
    Andreae, Peter
    Gao, Xiaoying
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 191 - 200
  • [30] Automatic Classification of Uighur Web Pages
    Xu Guixian
    Gao Xu
    Zhao Xiaobing
    Yang Guosheng
    2013 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM DESIGN AND ENGINEERING APPLICATIONS (ISDEA), 2013, : 390 - 393