The JDDC Corpus: A Large-Scale Multi-Turn Chinese Dialogue Dataset for E-commerce Customer Service

被引:0
|
作者
Chen, Meng [1 ]
Liu, Ruixue [1 ]
Shen, Lei [1 ]
Yuan, Shaozu [1 ]
Zhou, Jingyan [1 ]
Wu, Youzheng [1 ]
He, Xiaodong [1 ]
Zhou, Bowen [1 ]
机构
[1] JD AI, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020) | 2020年
关键词
large-scale dataset; multi-turn dialogues; real E-commerce scenario;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Human conversations are complicated and building a human-like dialogue agent is an extremely challenging task. With the rapid development of deep learning techniques, data-driven models become more and more prevalent which need a huge amount of real conversation data. In this paper, we construct a large-scale real scenario Chinese E-commerce conversation corpus, JDDC, with more than 1 million multi-turn dialogues, 20 million utterances, and 150 million words. The dataset reflects several characteristics of human-human conversations, e.g., goal-driven, and long-term dependency among the context. It also covers various dialogue types including task-oriented, chitchat and question-answering. Extra intent information and three well-annotated challenge sets are also provided. Then, we evaluate several retrieval-based and generative models to provide basic benchmark performance on the JDDC corpus. And we hope JDDC can serve as an effective testbed and benefit the development of fundamental research in dialogue task.
引用
收藏
页码:459 / 466
页数:8
相关论文
共 50 条
  • [21] Research on transaction security detection algorithm for Large-scale e-commerce website
    Hu Guixiang
    Qian Xinjie
    Fu Qiulin
    Yang Bo
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ELECTRONIC TECHNOLOGY, 2015, 6 : 262 - 265
  • [22] e-CLIP: Large-Scale Vision-Language Representation Learning in E-commerce
    Shin, Wonyoung
    Park, Jonghun
    Woo, Taekang
    Cho, Yongwoo
    Oh, Kwangjin
    Song, Hwanjun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3484 - 3494
  • [23] ICS-Assist: Intelligent Customer Inquiry Resolution Recommendation in Online Customer Service for Large E-Commerce Businesses
    Fu, Min
    Guan, Jiwei
    Zheng, Xi
    Zhou, Jie
    Lu, Jianchao
    Zhang, Tianyi
    Zhuo, Shoujie
    Zhan, Lijun
    Yang, Jian
    SERVICE-ORIENTED COMPUTING (ICSOC 2020), 2020, 12571 : 370 - 385
  • [24] Study of Large-scale Enterprise Strategic Decision Support System in E-commerce Environment
    Jiang, Yuantao
    2011 INTERNATIONAL CONFERENCE ON COMPUTER, ELECTRICAL, AND SYSTEMS SCIENCES, AND ENGINEERING (CESSE 2011), 2011, : 528 - 531
  • [25] A linguistic solution for double large-scale group decision-making in E-commerce
    Wu, Tong
    Liu, Xinwang
    Qin, Jindong
    COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 116 : 97 - 112
  • [26] Building Large-Scale Deep Learning System for Entity Recognition in E-Commerce Search
    Wen, Musen
    Vasthimal, Deepak Kumar
    Lu, Alan
    Wang, Tian
    Guo, Aimin
    BDCAT'19: PROCEEDINGS OF THE 6TH IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, 2019, : 149 - 154
  • [27] Large-Scale Item Categorization in e-Commerce Using Multiple Recurrent Neural Networks
    Ha, Jung-Woo
    Pyo, Hyuna
    Kim, Jeonghee
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 107 - 115
  • [28] A large-scale last-mile consolidation model for e-commerce home delivery
    Munoz-Villamizar, Andres
    Velazquez-Martinez, Josue C.
    Caballero-Caballero, Sergio
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [29] Hierarchical Bipartite Graph Neural Networks: Towards Large-Scale E-commerce Applications
    Li, Zhao
    Shen, Xin
    Jiao, Yuhang
    Pan, Xuming
    Zou, Pengcheng
    Meng, Xianling
    Yao, Chengwei
    Bu, Jiajun
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1677 - 1688
  • [30] CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset
    Zhu, Qi
    Huang, Kaili
    Zhang, Zheng
    Zhu, Xiaoyan
    Huang, Minlie
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 (08) : 281 - 295