LtrGCN: Large-Scale Graph Convolutional Networks-Based Learning to Rank for Web Search

被引：3

作者：

Li, Yuchen ^{[1
]}

Xiong, Haoyi ^{[2
]}

Kong, Linghe ^{[1
]}

Wang, Shuaiqiang ^{[2
]}

Sun, Zeyi ^{[3
]}

Chen, Hongyang ^{[3
]}

Chen, Guihai ^{[1
]}

Yin, Dawei ^{[2
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[2] Baidu Inc, Beijing, Peoples R China

[3] Zhejiang Lab, Hangzhou, Peoples R China

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI | 2023年 / 14174卷

基金：

上海市科技启明星计划; 国家重点研发计划;

关键词：

Learning to Rank; Graph Convolutional Networks; Web Search;

D O I：

10.1007/978-3-031-43427-3_38

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While traditional Learning to Rank (LTR) models use query-webpage pairs to perform regression tasks to predict the ranking scores, they usually fail to capture the structure of interactions between queries and webpages over an extremely large bipartite graph. In recent years, Graph Convolutional Neural Networks (GCNs) have demonstrated their unique advantages in link prediction over bipartite graphs and have been successfully used for user-item recommendations. However, it is still difficult to scale-up GCNs for web search, due to the (1) extreme sparsity of links in query-webpage bipartite graphs caused by the expense of ranking scores annotation and (2) imbalance between queries (billions) and web-pages (trillions) for web-scale search as well as the imbalance in annotations. In this work, we introduce the Q-subgraph and W-subgraph to represent every query and webpage with the structure of interaction preserved, and then propose LtrGCN-an LTR pipeline that samples Q-subgraphs and W-subgraphs from all query-webpage pairs, learns to extract features from Q-subgraphs and W-subgraphs, and predict ranking scores in an end-to-end manner. We carried out extensive experiments to evaluate LtrGCN using two real-world datasets and online experiments based on the A/B test at a large-scale search engine. The offline results show that LtrGCN could achieve Delta NDCG(5) = 2.89%-3.97% compared to baselines. We deploy LtrGCN with realistic traffic at a large-scale search engine, where we can still observe significant improvement. LtrGCN performs consistently in both offline and online experiments.

引用

页码：635 / 651

页数：17

共 50 条

[21] A graph-based cache for large-scale similarity search engines
Gil-Costa, Veronica
Marin, Mauricio
Bonacic, Carolina
Solar, Roberto
JOURNAL OF SUPERCOMPUTING, 2018, 74 (05): : 2006 - 2034
[22] Graph convolutional neural networks-based assessment of students' collaboration ability
Lin, Jinjiao
Gao, Tianqi
Wen, Yuhua
Yu, Xianmiao
You, Bizhen
Yin, Yanfang
Zhao, Yanze
Pu, Haitao
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (28):
[23] On the Large-Scale Transferability of Convolutional Neural Networks
Zheng, Liang
Zhao, Yali
Wang, Shengjin
Wang, Jingdong
Yang, Yi
Tian, Qi
TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING: PAKDD 2018 WORKSHOPS, 2018, 11154 : 27 - 39
[24] Visual Odometry Based on Convolutional Neural Networks for Large-Scale Scenes
Meng, Xuyang
Fan, Chunxiao
Ming, Yue
TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
[25] A Survey of Large-Scale Graph Neural Networks
Xiao G.-Q.
Li X.-Q.
Chen Y.-D.
Tang Z.
Jiang W.-J.
Li K.-L.
Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (01): : 148 - 171
[26] A Large Scale Search Dataset for Unbiased Learning to Rank
Zou, Lixin
Mao, Haitao
Chu, Xiaokai
Tang, Jiliang
Wang, Shuaiqiang
Ye, Wenwen
Yin, Dawei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[27] Performance of Graph Reconstruction Method for Large-Scale Web Graph Analysis
Takei, Ryota
Niimi, Ayahiko
PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2852 - 2854
[28] Improved Graph Convolutional Neural Networks-based Cellular Network Fault Diagnosis
Gao, Zongzhen
Liu, Wenlai
EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2025, 27 (02):
[29] Large-scale knowledge graph representation learning
Badrouni, Marwa
Katar, Chaker
Inoubli, Wissem
KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (09) : 5479 - 5499
[30] A Sampling-Based Graph Clustering Algorithm for Large-Scale Networks
Zhang J.-P.
Chen H.-C.
Wang K.
Zhu K.-J.
Wang Y.-W.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (08): : 1731 - 1737

← 1 2 3 4 5 →