QuadrupletBERT: An Efficient Model For Embedding-Based Large-Scale Retrieval

被引:0
|
作者
Liu, Peiyang [1 ,2 ]
Wang, Sen [3 ]
Wang, Xi [2 ]
Ye, Wei [1 ]
Zhang, Shikun [1 ]
机构
[1] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing, Peoples R China
[2] Peking Univ, Sch Software & Microelectron, Beijing, Peoples R China
[3] PX Secur, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The embedding-based large-scale query-document retrieval problem is a hot topic in the information retrieval (IR) field. Considering that pre-trained language models like BERT have achieved great success in a wide variety of NLP tasks, we present a QuadrupletBERT model for effective and efficient retrieval in this paper. Unlike most existing BERT-style retrieval models, which only focus on the ranking phase in retrieval systems, our model makes considerable improvements to the retrieval phase and leverages the distances between simple negative and hard negative instances to obtaining better embeddings. Experimental results demonstrate that our QuadrupletBERT achieves state-of-the-art results in embedding-based large-scale retrieval tasks.
引用
收藏
页码:3734 / 3739
页数:6
相关论文
共 50 条
  • [41] Very Large-Scale Image Retrieval Based on Local Features
    Yin, Chang-Qing
    Mao, Wei
    Jiang, Wei
    EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, 2012, 304 : 242 - +
  • [42] Large-Scale Image Retrieval with Elasticsearch
    Amato, Giuseppe
    Bolettieri, Paolo
    Carrara, Fabio
    Falchi, Fabrizio
    Gennaro, Claudio
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 925 - 928
  • [43] Large-Scale Image Retrieval Method Based on Vocabulary Tree
    Qi Jin
    Zhao Jian
    Xie Yu
    Chen Xiao-ning
    12TH ANNUAL MEETING OF CHINA ASSOCIATION FOR SCIENCE AND TECHNOLOGY ON INFORMATION AND COMMUNICATION TECHNOLOGY AND SMART GRID, 2010, : 219 - 223
  • [44] Large-Scale Multimedia Retrieval and Mining
    Yan, Rong
    Huet, Benoit
    Sukthankar, Rahul
    IEEE MULTIMEDIA, 2011, 18 (01) : 11 - 13
  • [45] COMPACT FEATURE BASED CLUSTERING FOR LARGE-SCALE IMAGE RETRIEVAL
    Liang, Yan
    Dong, Le
    Xie, Shanshan
    Lv, Na
    Xu, Zongyi
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2014,
  • [46] Component-Based Attention for Large-Scale Trademark Retrieval
    Tursun, Osman
    Denman, Simon
    Sivapalan, Sabesan
    Sridharan, Sridha
    Fookes, Clinton
    Mau, Sandra
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 : 2350 - 2363
  • [47] Embedding-based retrieval: measures of threshold recall and precision to evaluate product search
    Krasnov, Fedor V.
    BIZNES INFORMATIKA-BUSINESS INFORMATICS, 2024, 18 (02): : 22 - 34
  • [48] Large-Scale Retrieval for Reinforcement Learning
    Humphreys, Peter C.
    Guez, Arthur
    Tieleman, Olivier
    Sifre, Laurent
    Weber, Theophane
    Lillicrap, Timothy
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [49] Click-through-based Word Embedding for Large Scale Image Retrieval
    Chen, Yun
    Li, Victor O. K.
    2016 IEEE SECOND INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2016, : 145 - 148
  • [50] Efficient discrete supervised hashing for large-scale cross-modal retrieval
    Yao, Tao
    Han, Yaru
    Wang, Ruxin
    Kong, Xiangwei
    Yan, Lianshan
    Fu, Haiyan
    Tian, Qi
    NEUROCOMPUTING, 2020, 385 (385) : 358 - 367