LtrGCN: Large-Scale Graph Convolutional Networks-Based Learning to Rank for Web Search

被引:3
|
作者
Li, Yuchen [1 ]
Xiong, Haoyi [2 ]
Kong, Linghe [1 ]
Wang, Shuaiqiang [2 ]
Sun, Zeyi [3 ]
Chen, Hongyang [3 ]
Chen, Guihai [1 ]
Yin, Dawei [2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Baidu Inc, Beijing, Peoples R China
[3] Zhejiang Lab, Hangzhou, Peoples R China
基金
上海市科技启明星计划; 国家重点研发计划;
关键词
Learning to Rank; Graph Convolutional Networks; Web Search;
D O I
10.1007/978-3-031-43427-3_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While traditional Learning to Rank (LTR) models use query-webpage pairs to perform regression tasks to predict the ranking scores, they usually fail to capture the structure of interactions between queries and webpages over an extremely large bipartite graph. In recent years, Graph Convolutional Neural Networks (GCNs) have demonstrated their unique advantages in link prediction over bipartite graphs and have been successfully used for user-item recommendations. However, it is still difficult to scale-up GCNs for web search, due to the (1) extreme sparsity of links in query-webpage bipartite graphs caused by the expense of ranking scores annotation and (2) imbalance between queries (billions) and web-pages (trillions) for web-scale search as well as the imbalance in annotations. In this work, we introduce the Q-subgraph and W-subgraph to represent every query and webpage with the structure of interaction preserved, and then propose LtrGCN-an LTR pipeline that samples Q-subgraphs and W-subgraphs from all query-webpage pairs, learns to extract features from Q-subgraphs and W-subgraphs, and predict ranking scores in an end-to-end manner. We carried out extensive experiments to evaluate LtrGCN using two real-world datasets and online experiments based on the A/B test at a large-scale search engine. The offline results show that LtrGCN could achieve Delta NDCG(5) = 2.89%-3.97% compared to baselines. We deploy LtrGCN with realistic traffic at a large-scale search engine, where we can still observe significant improvement. LtrGCN performs consistently in both offline and online experiments.
引用
收藏
页码:635 / 651
页数:17
相关论文
共 50 条
  • [21] A graph-based cache for large-scale similarity search engines
    Gil-Costa, Veronica
    Marin, Mauricio
    Bonacic, Carolina
    Solar, Roberto
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (05): : 2006 - 2034
  • [22] Graph convolutional neural networks-based assessment of students' collaboration ability
    Lin, Jinjiao
    Gao, Tianqi
    Wen, Yuhua
    Yu, Xianmiao
    You, Bizhen
    Yin, Yanfang
    Zhao, Yanze
    Pu, Haitao
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (28):
  • [23] On the Large-Scale Transferability of Convolutional Neural Networks
    Zheng, Liang
    Zhao, Yali
    Wang, Shengjin
    Wang, Jingdong
    Yang, Yi
    Tian, Qi
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING: PAKDD 2018 WORKSHOPS, 2018, 11154 : 27 - 39
  • [24] Visual Odometry Based on Convolutional Neural Networks for Large-Scale Scenes
    Meng, Xuyang
    Fan, Chunxiao
    Ming, Yue
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [25] A Survey of Large-Scale Graph Neural Networks
    Xiao G.-Q.
    Li X.-Q.
    Chen Y.-D.
    Tang Z.
    Jiang W.-J.
    Li K.-L.
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (01): : 148 - 171
  • [26] A Large Scale Search Dataset for Unbiased Learning to Rank
    Zou, Lixin
    Mao, Haitao
    Chu, Xiaokai
    Tang, Jiliang
    Wang, Shuaiqiang
    Ye, Wenwen
    Yin, Dawei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [27] Performance of Graph Reconstruction Method for Large-Scale Web Graph Analysis
    Takei, Ryota
    Niimi, Ayahiko
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2852 - 2854
  • [28] Improved Graph Convolutional Neural Networks-based Cellular Network Fault Diagnosis
    Gao, Zongzhen
    Liu, Wenlai
    EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2025, 27 (02):
  • [29] Large-scale knowledge graph representation learning
    Badrouni, Marwa
    Katar, Chaker
    Inoubli, Wissem
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (09) : 5479 - 5499
  • [30] A Sampling-Based Graph Clustering Algorithm for Large-Scale Networks
    Zhang J.-P.
    Chen H.-C.
    Wang K.
    Zhu K.-J.
    Wang Y.-W.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (08): : 1731 - 1737