Identification of ambiguous queries in web search

被引:37
|
作者
Song, Ruihua [1 ,2 ]
Luo, Zhenxiao [3 ]
Nie, Jian-Yun [4 ]
Yu, Yong [2 ]
Hon, Hsiao-Wuen [1 ]
机构
[1] Microsoft Res Asia, Beijing 100190, Peoples R China
[2] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[3] Fudan Univ, Shanghai 200433, Peoples R China
[4] Univ Montreal, Montreal, PQ H3C 3J7, Canada
关键词
Ambiguous query; Query classification; Broad topics; Query taxonomy;
D O I
10.1016/j.ipm.2008.09.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is widely believed that many queries submitted to search engines are inherently ambiguous (e.g., java and apple). However, few studies have tried to classify queries based on ambiguity and to answer "what the proportion of ambiguous queries is". This paper deals with these issues. First, we clarify the definition of ambiguous queries by constructing the taxonomy of queries from being ambiguous to specific. Second, we ask human annotators to manually classify queries. From manually labeled results, we observe that query ambiguity is to some extent predictable. Third, we propose a supervised learning approach to automatically identify ambiguous queries. Experimental results show that we can correctly identify 87% of labeled queries with the approach. Finally, by using our approach, we estimate that about 16% of queries in a real search log are ambiguous. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:216 / 229
页数:14
相关论文
共 50 条
  • [21] A vector model for routing queries in web search engines
    Oyarzun, Mauricio S.
    Gonzalez, Senen
    Mendoza, Marcelo
    Ferrarotti, Flavio
    Chacon, Max
    Marin, Mauricio
    ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 457 - 464
  • [22] Classifying Search Queries Using the Web as a Source of Knowledge
    Gabrilovich, Evgeniy
    Broder, Andrei
    Fontoura, Marcus
    Joshi, Amruta
    Josifovski, Vanja
    Riedel, Lance
    Zhang, Tong
    ACM TRANSACTIONS ON THE WEB, 2009, 3 (02)
  • [23] Temporal Dynamics of User Interests in Web Search Queries
    Cayci, Aysegul
    Sumengen, Selcuk
    Turkay, Cagatay
    Balcisoy, Selim
    Saygin, Yucel
    2009 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS: WAINA, VOLS 1 AND 2, 2009, : 762 - 767
  • [24] Translating Web Search Queries into Natural Language Questions
    Kumar, Adarsh
    Dandapat, Sandipan
    Chordia, Sushil
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 944 - 947
  • [25] Semantic Web search based on ontological conjunctive queries
    Fazzinga, Bettina
    Gianforme, Giorgio
    Gottlob, Georg
    Lukasiewicz, Thomas
    JOURNAL OF WEB SEMANTICS, 2011, 9 (04): : 453 - 473
  • [26] Reducing hardware hit by queries in web search engines
    Mendoza, Marcelo
    Marin, Mauricio
    Gil-Costa, Veronica
    Ferrarotti, Flavio
    INFORMATION PROCESSING & MANAGEMENT, 2016, 52 (06) : 1031 - 1052
  • [27] Forecasting Youth Unemployment in Korea with Web Search Queries
    Kwon, Chi-Myung
    Jung, Jae Un
    Internet of Things, Smart Spaces, and Next Generation Networks and Systems, NEW2AN 2016/uSMART 2016, 2016, 9870 : 3 - 14
  • [28] Semantic Web Search Based on Ontological Conjunctive Queries
    Fazzinga, Bettina
    Gianforme, Giorgio
    Gottlob, Georg
    Lukasiewicz, Thomas
    FOUNDATIONS OF INFORMATION AND KNOWLEDGE SYSTEMS, PROCEEDINGS, 2010, 5956 : 153 - +
  • [29] Identifying Popular Search Goals behind Search Queries to Improve Web Search Ranking
    Wang Ting-Xuan
    Lu Wen-Hsiang
    INFORMATION RETRIEVAL TECHNOLOGY, 2011, 7097 : 250 - +
  • [30] Web Search Using Summarization on Clustered Web Documents Retrieved by User Queries
    Qumsiyeh, Rani
    Ng, Yiu-Kai
    2015 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT), VOL 1, 2015, : 401 - 404