Mining Related Queries from Query Logs Based on Linear Regression

被引:0
|
作者
Zhai, Haijun [1 ]
Zhang, Jin [2 ]
Wang, Xiaolei [2 ]
Zhang, Gang [2 ]
机构
[1] Univ Sci & Technol China, Dept Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China
关键词
D O I
10.1109/FITME.2008.59
中图分类号
F [经济];
学科分类号
02 ;
摘要
In this paper a novel linear regression model is proposed to mine related queries from query logs. Three types of association relationships between queries are identified and leveraged in our model, which include query session co-occurence, URL-clicked sharing and text similarity. Previous work directly applies part of these relations, which may be largely affected by the noise in query logs, such as the sparsity of click-through data, query-session segmentation errors and noisy clicks. In this work we propose linear regression analysis to identify effective features. In this way, we can effectively deal with the noise issue. The experiments demonstrate that the features identified with linear regression analysis are very effective. Moreover, the performance of our proposed linear regression model outperforms existing methods.
引用
收藏
页码:665 / +
页数:2
相关论文
共 50 条
  • [41] Accelerating Machine Learning Queries with Linear Algebra Query Processing
    Sun, Wenbo
    Katsifodimos, Asterios
    Hai, Rihan
    35TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2023, 2023,
  • [42] Accelerating machine learning queries with linear algebra query processing
    Sun, Wenbo
    Katsifodimos, Asterios
    Hai, Rihan
    DISTRIBUTED AND PARALLEL DATABASES, 2025, 43 (01)
  • [43] Extracting Semantic Relations from Query Logs;
    Baeza-Yates, Ricardo
    Tiberi, Alessandro
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 76 - 85
  • [44] An algorithm for skyline queries based on window query
    Yu, J
    Liu, X
    Liu, GH
    Proceedings of the 11th Joint International Computer Conference, 2005, : 267 - 270
  • [45] RebaCQ: Query Refinement Based on Consecutive Queries
    Hung, Chia-Hsin
    Tsai, Shuo-En
    Chen, Yi-Shin
    PROCEEDINGS OF THE 2009 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2008, : 366 - +
  • [46] Mining the Query Logs of a Chinese Web Search Engine for Character Usage Analysis
    Lu, Yan
    Chau, Michael
    Fang, Xiao
    PACIFIC ASIA CONFERENCE ON INFORMATION SYSTEMS 2006, SECTIONS 1-8, 2006, : 346 - +
  • [47] User k-anonymity for privacy preserving data mining of query logs
    Navarro-Arribas, Guillermo
    Torra, Vicenc
    Erola, Arnau
    Castella-Roca, Jordi
    INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (03) : 476 - 487
  • [48] A Novel Bipartite Graph Based Competitiveness Degree Analysis from Query Logs
    Wei, Qiang
    Qiao, Dandan
    Zhang, Jin
    Chen, Guoqing
    Guo, Xunhua
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2016, 11 (02)
  • [49] Similarity of temporal query logs based on ARIMA model
    Liu, Ning
    Nong, Shuzhen
    Yan, Jun
    Zhang, Benyu
    Chen, Zheng
    Li, Ying
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 975 - 979
  • [50] Mining Historic Query Trails to Label Long and Rare Search Engine Queries
    Bailey, Peter
    White, Ryen W.
    Liu, Han
    Kumaran, Giridhar
    ACM TRANSACTIONS ON THE WEB, 2010, 4 (04)