MuCPAD: A Multi-Domain Chinese Predicate-Argument Dataset

被引:0
|
作者
Liu, Yahui [1 ]
Yang, Haoping [1 ]
Gong, Chen [1 ]
Xia, Qingrong [1 ]
Li, Zhenghua [1 ]
Zhang, Min [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Inst Artificial Intelligence, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
During the past decade, neural network models have made tremendous progress on in-domain semantic role labeling (SRL). However, performance drops dramatically under the out-of-domain setting. In order to facilitate research on cross-domain SRL, this paper presents MuCPAD, a multi-domain Chinese predicate-argument dataset, which consists of 30,897 sentences and 92,051 predicates from six different domains. MuCPAD exhibits three important features. 1) Based on a frame-free annotation methodology, we avoid writing complex frames for new predicates. 2) We explicitly annotate omitted core arguments to recover more complete semantic structure, considering that omission of content words is ubiquitous in multi-domain Chinese texts. 3) We compile 53 pages of annotation guidelines and adopt strict double annotation for improving data quality. This paper describes in detail the annotation methodology and annotation process of MuCPAD, and presents in-depth data analysis. We also give benchmark results on cross-domain SRL based on MuCPAD.
引用
收藏
页码:1707 / 1717
页数:11
相关论文
共 50 条
  • [1] Predicate-argument relevance model for chinese-to-naxi SMT
    School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China
    不详
    J. Comput. Inf. Syst., 13 (4857-4861):
  • [2] Robust Integrated Models for Chinese Predicate-Argument Structure Analysis
    Luo Yanyan
    Masayuki, Asahara
    Yuji, Matsumoto
    CHINA COMMUNICATIONS, 2012, 9 (03) : 10 - 18
  • [3] The neural basis of predicate-argument structure
    Hurford, JR
    BEHAVIORAL AND BRAIN SCIENCES, 2003, 26 (03) : 261 - +
  • [4] DYNAMIC INTERPRETATION OF PREDICATE-ARGUMENT STRUCTURE
    Jezek, Elisabetta
    Pustejovsky, James
    LINGUE E LINGUAGGIO, 2019, 18 (02) : 179 - 207
  • [5] VERB SENSE DISAMBIGUATION BASED ON THESAURUS OF PREDICATE-ARGUMENT STRUCTURE An Evaluation of Thesaurus of Predicate-argument Structure for Japanese Verbs
    Takeuchi, Koichi
    Tsuchiyama, Suguru
    Moriya, Masato
    Moriyasu, Yuuki
    Satoh, Koichi
    KEOD 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE ENGINEERING AND ONTOLOGY DEVELOPMENT, 2011, : 208 - 213
  • [6] Verb sense disambiguation based on thesaurus of predicate-argument structure: An evaluation of thesaurus of predicate-argument structure for Japanese verbs
    Takeuchi, Koichi
    Tsuchiyama, Suguru
    Moriya, Masato
    Moriyasu, Yuuki
    Satoh, Koichi
    KEOD 2011 - Proceedings of the International Conference on Knowledge Engineering and Ontology Development, 2011, : 208 - 213
  • [7] Parsing with generative models of predicate-argument structure
    Hockenmaier, J
    41ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2003, : 359 - 366
  • [8] Experiments on the Identification of Predicate-Argument Structure in Polish
    Goluchowski, Konrad
    ADVANCES IN NATURAL LANGUAGE PROCESSING, 2014, 8686 : 185 - 192
  • [9] Dependency Tree Representations of Predicate-Argument Structures
    Qiu, Likun
    Zhang, Yue
    Zhang, Meishan
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2645 - 2651
  • [10] Exploring predicate-argument relations for named entity recognition in the molecular biology domain
    Wattarujeekrit, T
    Collier, N
    DISCOVERY SCIENCE, PROCEEDINGS, 2005, 3735 : 267 - 280