NaturalConv: A Chinese Dialogue Dataset Towards Multi-turn Topic-driven Conversation

被引:0
|
作者
Wang, Xiaoyang [1 ]
Li, Chen [1 ]
Zhao, Jianqiao [1 ]
Yu, Dong [1 ]
机构
[1] Tencent AI Lab, Bellevue, WA 98004 USA
来源
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a Chinese multi-turn topic-driven conversation dataset, NaturalConv, which allows the participants to chat anything they want as long as any element from the topic is mentioned and the topic shift is smooth. Our corpus contains 19.9K conversations from six domains, and 400K utterances with an average turn number of 20.1. These conversations contain in-depth discussions on related topics or widely natural transition between multiple topics. We believe either way is normal for human conversation. To facilitate the research on this corpus, we provide results of several benchmark models. Comparative results show that for this dataset, our current models are not able to provide significant improvement by introducing background knowledge/topic. Therefore, the proposed dataset should be a good benchmark for further research to evaluate the validity and naturalness of multi-turn conversation systems. Our dataset is available at https://ai.tencent.com/ailab/nlp/dialogue/#datasets.
引用
收藏
页码:14006 / 14014
页数:9
相关论文
共 50 条
  • [1] MuTual: A Dataset for Multi-Turn Dialogue Reasoning
    Cui, Leyang
    Wu, Yu
    Liu, Shujie
    Zhang, Yue
    Zhou, Ming
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1406 - 1416
  • [2] MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation
    Feng, Jiazhan
    Sun, Qingfeng
    Xu, Can
    Zhao, Pu
    Yang, Yaming
    Tao, Chongyang
    Zhao, Dongyan
    Lin, Qingwei
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 7348 - 7363
  • [3] Topic-Aware Multi-turn Dialogue Modeling
    Xu, Yi
    Zhao, Hai
    Zhang, Zhuosheng
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14176 - 14184
  • [4] A Practical Dialogue-Act-Driven Conversation Model for Multi-Turn Response Selection
    Kumar, Harshit
    Agarwal, Arvind
    Joshi, Sachindra
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1980 - 1989
  • [5] MatDC: A Multi-turn Multi-domain Annotated Task-oriented Dialogue Dataset in Chinese
    Tseng, Yu-Hsiang
    Hsieh, Shu-Kai
    Lian, Richard
    Chiang, Chiung-Yu
    Chang, Yu-Lin
    Chang, Li-Ping
    Hsieh, Ji-Lung
    2020 25TH INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2020), 2020, : 165 - 170
  • [6] ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
    Lovenia, Holy
    Cahyawijaya, Samuel
    Winata, Genta Indra
    Xu, Peng
    Yan, Xu
    Liu, Zihan
    Frieske, Rita
    Yu, Tiezheng
    Dai, Wenliang
    Barezi, Elham J.
    Chen, Qifeng
    Ma, Xiaojuan
    Shi, Bertram E.
    Fung, Pascale
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 7259 - 7268
  • [7] Multi-turn dialogue comprehension from a topic-aware perspective
    Ma, Xinbei
    Xu, Yi
    Zhao, Hai
    Zhang, Zhuosheng
    NEUROCOMPUTING, 2024, 578
  • [8] Hierarchical latent variables structure for topic aware multi-turn conversation
    Cui, Fuwei
    Di, Hui
    Huang, Hui
    Ouchi, Kazushige
    Liu, Ze
    Xu, Jinan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (03) : 3805 - 3814
  • [9] Multi-turn Dialogue Generation Model with Dialogue Structure
    Jiang X.-T.
    Wang Z.-Q.
    Li S.-S.
    Zhou G.-D.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (11): : 4239 - 4250
  • [10] Topic-level knowledge sub-graphs for multi-turn dialogue generation
    Li, Jing
    Huang, Qingbao
    Cai, Yi
    Liu, Yongkang
    Fu, Mingyi
    Li, Qing
    KNOWLEDGE-BASED SYSTEMS, 2021, 234