Intent Term Weighting in E-Commerce Queries

被引:7
|
作者
Manchanda, Saurav [1 ,2 ]
Sharma, Mohit [2 ]
Karypis, George [1 ]
机构
[1] Univ Minnesota, Minneapolis, MN 55455 USA
[2] WalmartLabs, Sunnyvale, CA USA
基金
美国国家科学基金会;
关键词
Term weighting; query intent; query refinement; query reformulation;
D O I
10.1145/3357384.3358151
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
E-commerce search engines can fail to retrieve results that satisfy a query's product intent because: (i) conventional retrieval approaches, such as BM25, may ignore the important terms in queries owing to their low inverse document frequency (IDF), and (ii) for long queries, as is usually the case in rare queries (i.e., tail queries), they may fail to determine the relevant terms that are representative of the query's product intent. In this paper, we lever-age the historical query reformulation logs of a large e-retailer (walmart.com) to develop a distant-supervision-based approach to identify the relevant terms that characterize the query's product intent. The key idea underpinning our approach is that the terms retained in the reformulation of a query are more important in describing the query's product intent than the discarded terms. Additionally, we also use the fact that the significance of a term depends on its context (other terms in the neighborhood) in the query to determine the term's importance towards the query's product intent. We show that identifying and emphasizing the terms that define the query's product intent leads to a 3% improvement in ranking and outperforms the context-unaware baselines.
引用
收藏
页码:2345 / 2348
页数:4
相关论文
共 50 条