Web-Scale Semantic Product Search with Large Language Models

被引:3
|
作者
Muhamed, Aashiq [1 ]
Srinivasan, Sriram [1 ]
Teo, Choon-Hui [1 ]
Cui, Qingjun [1 ]
Zeng, Belinda [2 ]
Chilimbi, Trishul [2 ]
Vishwanathan, S. V. N. [1 ]
机构
[1] Amazon, Palo Alto, CA 94303 USA
[2] Amazon, Seattle, WA USA
关键词
Matching; Retrieval; Search; Pretrained Language Models;
D O I
10.1007/978-3-031-33380-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dense embedding-based semantic matching is widely used in e-commerce product search to address the shortcomings of lexical matching such as sensitivity to spelling variants. The recent advances in BERT-like language model encoders, have however, not found their way to realtime search due to the strict inference latency requirement imposed on e-commerce websites. While bi-encoder BERT architectures enable fast approximate nearest neighbor search, training them effectively on query-product data remains a challenge due to training instabilities and the persistent generalization gap with cross-encoders. In this work, we propose a four-stage training procedure to leverage large BERT-like models for product search while preserving low inference latency. We introduce query-product interaction pre-finetuning to effectively pretrain BERT bi-encoders for matching and improve generalization. Through offline experiments on an e-commerce product dataset, we show that a distilled small BERT-based model (75M params) trained using our approach improves the search relevance metric by up to 23% over a baseline DSSM-based model with similar inference latency. The small model only suffers a 3% drop in relevance metric compared to the 20x larger teacher. We also show using online A/B tests at scale, that our approach improves over the production model in exact and substitute products retrieved.
引用
收藏
页码:73 / 85
页数:13
相关论文
共 50 条
  • [11] Web-scale language-independent cataloging of noisy product listings for E-Commerce
    Das, Pradipto
    Xia, Yandi
    Levine, Aaron
    Di Fabbrizio, Giuseppe
    Datta, Ankur
    15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Proceedings of Conference, 2017, 2 : 969 - 979
  • [12] Leveraging Knowledge Graphs for Web-Scale Unsupervised Semantic Parsing
    Heck, Larry
    Hakkani-Tur, Dilek
    Tur, Gokhan
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1593 - 1597
  • [13] Web-Scale Language-Independent Cataloging of Noisy Product Listings for E-Commerce
    Das, Pradipto
    Xia, Yandi
    Levine, Aaron
    Di Fabbrizio, Giuseppe
    Datta, Ankur
    15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 969 - 979
  • [14] Neural Embedding Language Models in Semantic Clustering of Web Search Results
    Kutuzov, Andrey
    Kuzmenko, Elizaveta
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3044 - 3048
  • [15] Web-Scale Near-Duplicate Search: Techniques and Applications
    Ngo, Chong-Wah
    Xu, Changsheng
    Kraaij, Wessel
    El Saddik, Abdulmotaleb
    IEEE MULTIMEDIA, 2013, 20 (03) : 10 - 12
  • [16] Unifying Web-Scale Search and Reasoning from the Viewpoint of Granularity
    Zeng, Yi
    Wang, Yan
    Huang, Zhisheng
    Zhong, Ning
    ACTIVE MEDIA TECHNOLOGY, PROCEEDINGS, 2009, 5820 : 418 - +
  • [17] Web-Scale Datacenters
    Douglis, Fred
    IEEE INTERNET COMPUTING, 2014, 18 (04) : 13 - 14
  • [18] Automatic Web Image Annotation via Web-Scale Image Semantic Space Learning
    Xu, Hongtao
    Zhou, Xiangdong
    Lin, Lan
    Xiang, Yu
    Shi, Baile
    ADVANCES IN DATA AND WEB MANAGEMENT, PROCEEDINGS, 2009, 5446 : 211 - +
  • [19] WEBLENS: Towards Web-scale Data Integration, Training the Models
    Khan, Rituparna
    Gubanov, Michael
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5727 - 5729
  • [20] Semantic Web-Based Product Search
    Vandic, Damir
    Milea, Viorel
    ADVANCES IN CONCEPTUAL MODELING, ER 2013, 2014, 8697 : 150 - 159