QuadrupletBERT: An Efficient Model For Embedding-Based Large-Scale Retrieval

被引:0
|
作者
Liu, Peiyang [1 ,2 ]
Wang, Sen [3 ]
Wang, Xi [2 ]
Ye, Wei [1 ]
Zhang, Shikun [1 ]
机构
[1] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing, Peoples R China
[2] Peking Univ, Sch Software & Microelectron, Beijing, Peoples R China
[3] PX Secur, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The embedding-based large-scale query-document retrieval problem is a hot topic in the information retrieval (IR) field. Considering that pre-trained language models like BERT have achieved great success in a wide variety of NLP tasks, we present a QuadrupletBERT model for effective and efficient retrieval in this paper. Unlike most existing BERT-style retrieval models, which only focus on the ranking phase in retrieval systems, our model makes considerable improvements to the retrieval phase and leverages the distances between simple negative and hard negative instances to obtaining better embeddings. Experimental results demonstrate that our QuadrupletBERT achieves state-of-the-art results in embedding-based large-scale retrieval tasks.
引用
收藏
页码:3734 / 3739
页数:6
相关论文
共 50 条
  • [31] Large-scale Image Retrieval based on the Vocabulary Tree
    Cheng, Bo
    Zhuo, Li
    Zhang, Pei
    Zhang, Jing
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 299 - 304
  • [32] Improving Embedding-Based Retrieval in Friend Recommendation with ANN Query Expansion
    Kung, Pau Perng-Hwa
    Fan, Zihao
    Zhao, Tong
    Liu, Yozen
    Lai, Zhixin
    Shi, Jiahui
    Wu, Yan
    Yu, Jun
    Shah, Neil
    Venkataraman, Ganesh
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2930 - 2934
  • [33] An efficient radix trie-based semantic visual indexing model for large-scale image retrieval in cloud environment
    Krishnaraj, N.
    Elhoseny, Mohamed
    Lydia, E. Laxmi
    Shankar, K.
    ALDabbas, Omar
    SOFTWARE-PRACTICE & EXPERIENCE, 2021, 51 (03): : 489 - 502
  • [34] Iterative Manifold Embedding Layer Learned by Incomplete Data for Large-Scale Image Retrieval
    Xu, Jian
    Wang, Chunheng
    Qi, Chengzuo
    Shi, Cunzhao
    Xiao, Baihua
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (06) : 1551 - 1562
  • [35] Large-Scale Heterogeneous Feature Embedding
    Huang, Xiao
    Song, Qingquan
    Yang, Fan
    Hu, Xia
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3878 - 3885
  • [36] Large Margin Graph Embedding-Based Discriminant Dimensionality Reduction
    Tian, Yanjia
    Feng, Xiang
    SCIENTIFIC PROGRAMMING, 2021, 2021 (2021)
  • [37] Large-Scale Fingerprint Data Retrieval Based C-Means Clustering Model
    Wang, Decai
    Zhang, Weibing
    Chang, Xia
    Gao, Yuelin
    2023 11TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: IOT AND SMART CITY, ITIOTSC 2023, 2023, : 5 - 9
  • [38] Spectral embedding-based multiview features fusion for content-based image retrieval
    Feng, Lin
    Yu, Laihang
    Zhu, Hai
    JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (05)
  • [39] Concept-based and embedding-based models in lifelog retrieval: an empirical comparison of performance
    Nguyen, Manh-Duy
    Nguyen, Binh T.
    Gurrin, Cathal
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2025, 14 (02)
  • [40] Large-Scale Image Retrieval Based on Compressed Camera Identification
    Valsesia, Diego
    Coluccia, Giulio
    Bianchi, Tiziano
    Magli, Enrico
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (09) : 1439 - 1449