COSMO: A Large-Scale E-commerce Common Sense Knowledge Generation and Serving System at Amazon

被引:0
|
作者
Yu, Changlong [1 ]
Liu, Xin [1 ]
Maia, Jefferson [1 ]
Li, Yang [1 ]
Cao, Tianyu [1 ]
Gao, Yifan [1 ]
Song, Yangqiu [2 ]
Goutam, Rahul [1 ]
Zhang, Haiyang [1 ]
Yin, Bing [1 ]
Li, Zheng [1 ]
机构
[1] Amazon, Palo Alto, CA 94303 USA
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
关键词
D O I
10.1145/3626246.3653398
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Applications of large-scale knowledge graphs in the e-commerce platforms can improve shopping experience for their customers. While existing e-commerce knowledge graphs (KGs) integrate a large volume of concepts or product attributes, they fail to discover user intentions, leaving the gap with how people think, behave, and interact with surrounding world. In this work, we present COSMO, a scalable system to mine user-centric commonsense knowledge from massive behaviors and construct industry-scale knowledge graphs to empower diverse online services. In particular, we describe a pipeline for collecting high-quality seed knowledge assertions that are distilled from large language models (LLMs) and further refined by critic classifiers trained over human-in-the-loop annotated data. Since those generations may not always align with human preferences and contain noises, we then describe how we adopt instruction tuning to finetune an efficient language model (COSMO-LM) for faithful e-commerce commonsense knowledge generation at scale. COSMO-LM effectively expands our knowledge graph to 18 major categories at Amazon, producing millions of high-quality knowledge with only 30k annotated instructions. Finally COSMO has been deployed in Amazon search applications such as search navigation. Both offline and online A/B experiments demonstrate our proposed system achieves significant improvement. Furthermore, these experiments highlight the immense potential of commonsense knowledge extracted from instruction-finetuned large language models.
引用
收藏
页码:148 / 160
页数:13
相关论文
共 50 条
  • [21] FAIR: Fraud Aware Impression Regulation System in Large-scale Real-time E-Commerce Search Platform
    Li, Zhao
    Song, Junshuai
    Hu, Shichang
    Ruan, Shasha
    Zhang, Long
    Hu, Zehong
    Gao, Jun
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1898 - 1903
  • [22] X-Engine: An Optimized Storage Engine for Large-scale E-commerce Transaction Processing
    Huang, Gui
    Cheng, Xuntao
    Wang, Jianying
    Wang, Yujie
    He, Dengcheng
    Zhang, Tieying
    Li, Feifei
    Wang, Sheng
    Cao, Wei
    Li, Qiang
    SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 651 - 665
  • [23] Machine Learning Clustering for Collaborative Filtering Recommendation of Large-Scale E-commerce in Cloud Computing
    Han, Ling-Mei
    Gao, Yan-Ping
    Liu, Jian-Guo
    Journal of Network Intelligence, 2023, 8 (04): : 1321 - 1337
  • [24] Research on Business Model Innovation of the Traditional Large-scale Retail Enterprises' Transition to the E-commerce
    Lv, Xiaoping
    Liu, Xiaoli
    PROCEEDING OF 2012 INTERNATIONAL SYMPOSIUM ON MANAGEMENT OF TECHNOLOGY (ISMOT'2012), 2012, : 652 - 656
  • [25] Large-Scale E-Commerce Image Retrieval with Top-Weighted Convolutional Neural Networks
    Zhao, Shichao
    Xu, Youjiang
    Han, Yahong
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 285 - 288
  • [26] Designing a Multi-Stage Transport System Serving e-Commerce Activity
    Burinskiene, Aurelija
    SUSTAINABILITY, 2021, 13 (11)
  • [27] Interactive Latent Knowledge Selection for E-commerce Product Copywriting Generation
    Wang, Zeming
    Zou, Yanyan
    Fang, Yuejian
    Chen, Hongshen
    Ma, Mian
    Ding, Zhuoye
    Long, Bo
    PROCEEDINGS OF THE 5TH WORKSHOP ON E-COMMERCE AND NLP (ECNLP 5), 2022, : 8 - 19
  • [28] DAliM: Machine Learning Based Intelligent Lucky Money Determination for Large-Scale E-Commerce Businesses
    Fu, Min
    Wong, Chi Man
    Zhu, Hai
    Huang, Yanjun
    Li, Yuanping
    Zheng, Xi
    Wu, Jia
    Yang, Jian
    Vong, Chi Man
    SERVICE-ORIENTED COMPUTING (ICSOC 2018), 2018, 11236 : 740 - 755
  • [29] SHOAL: Large-scale Hierarchical Taxonomy via Graph-based Query Coalition in E-commerce
    Li, Zhao
    Chen, Xia
    Pan, Xuming
    Zou, Pengcheng
    Li, Yuchen
    Yu, Guoxian
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (12): : 1858 - 1861
  • [30] Real20M: A Large-scale E-commerce Dataset for Cross-domain Retrieval
    Chen, Yanzhe
    Zhong, Huasong
    He, Xiangteng
    Peng, Yuxin
    Cheng, Lele
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4939 - 4948