Generative Retrieval with Semantic Tree-Structured Identifiers and Contrastive Learning

被引:0
|
作者
Si, Zihua [1 ]
Sun, Zhongxiang [1 ]
Chen, Jiale [2 ]
Chen, Guozhang [2 ]
Zang, Xiaoxue [2 ]
Zheng, Kai [2 ]
Song, Yang [2 ]
Zhang, Xiao [1 ]
Xu, Jun [1 ]
Gai, Kun
机构
[1] Renmin Univ China, Beijing, Peoples R China
[2] Kuaishou Technol Co Ltd, Beijing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Recommendation; Generative Retrieval; Contrastive Learning;
D O I
10.1145/3673791.3698408
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recommender systems, the retrieval phase is at the first stage and of paramount importance, requiring both effectiveness and very high efficiency. Recently, generative retrieval methods such as DSI and NCI, offering the benefit of end-to-end differentiability, have become an emerging paradigm for document retrieval with notable performance improvement, suggesting their potential applicability in recommendation scenarios. A fundamental limitation of these methods is their approach of generating item identifiers as text inputs, which fails to capture the intrinsic semantics of item identifiers as indices. The structural aspects of identifiers are only considered in construction and ignored during training. In addition, generative retrieval methods often generate imbalanced tree structures and yield identifiers with inconsistent lengths, leading to increased item inference time and sub-optimal performance. We introduce a novel generative retrieval framework named SEATER, which learns SEmAntic Tree-structured item identifiERs using an encoder-decoder structure. To optimize the structure of item identifiers, SEATER incorporates two contrastive learning tasks to ensure the alignment of token embeddings and the ranking orders of similar identifiers. In addition, SEATER devises a balanced k-ary tree structure of item identifiers, thus ensuring consistent semantic granularity and inference efficiency. Extensive experiments on three public datasets and an industrial dataset have demonstrated that SEATER outperforms a number of state-of-the-art models significantly.
引用
收藏
页码:154 / 163
页数:10
相关论文
共 50 条
  • [1] Generative Kernels for Tree-Structured Data
    Bacciu, Davide
    Micheli, Alessio
    Sperduti, Alessandro
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) : 4932 - 4946
  • [2] Tree-structured Curriculum Learning based on Semantic Similarity of Text
    Han, Sanggyu
    Myaeng, Sung-Hyon
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 971 - 976
  • [3] A framework for end-to-end learning on semantic tree-structured data
    Woof, William
    Chen, Ke
    arXiv, 2020,
  • [4] eTREE: Learning Tree-structured Embeddings
    Almutairi, Faisal M.
    Wang, Yunlong
    Wang, Dong
    Zhao, Emily
    Sidiropoulos, Nicholas D.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 6609 - 6617
  • [5] Tree-structured conditional random fields for semantic annotation
    Tang, Jie
    Hong, Mingcai
    Li, Juanzi
    Liang, Bangyong
    SEMANTIC WEB - ISEC 2006, PROCEEDINGS, 2006, 4273 : 640 - 653
  • [6] Texture retrieval using tree-structured wavelet transform
    Zhou, SH
    Venkatesh, YV
    Ko, CC
    PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : A168 - A171
  • [7] A tree-structured image retrieval in the wavelet transform domain
    Sakji, S.
    Benazza-Benyahia, A.
    2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 1365 - 1368
  • [8] TREE-STRUCTURED KRONECKER CONVOLUTIONAL NETWORK FOR SEMANTIC SEGMENTATION
    Wu, Tianyi
    Tang, Sheng
    Zhang, Rui
    Cao, Juan
    Li, Jintao
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 940 - 945
  • [9] Learning Improved Semantic Representations with Tree-Structured LSTM for Hashtag Recommendation: An Experimental Study
    Zhu, Rui
    Yang, Delu
    Li, Yang
    INFORMATION, 2019, 10 (04)
  • [10] Visualization of tree-structured data through generative topographic mapping
    Gianniotis, Nikolaos
    Tino, Peter
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (08): : 1468 - 1493