The Chinese Discourse TreeBank: a Chinese corpus annotated with discourse relations

被引:0
|
作者
Yuping Zhou
Nianwen Xue
机构
[1] Brandeis University,
来源
关键词
Discourse TreeBank; Discourse relations; Chinese; Explicit and implicit discourse connectives;
D O I
暂无
中图分类号
学科分类号
摘要
The paper presents the Chinese Discourse TreeBank, a corpus annotated with Penn Discourse TreeBank style discourse relations that take the form of a predicate taking two arguments. We first characterize the syntactic and statistical distributions of Chinese discourse connectives as well as the role of Chinese punctuation marks in discourse annotation, and then describe how we design our annotation strategy procedure based on this characterization. The Chinese-specific features of our annotation strategy include annotating explicit and implicit discourse relations in one single pass, defining the argument labels on semantic, rather than syntactic, grounds, as well as annotating the semantic type of implicit discourse relations directly. We also introduce a flat, 11-valued semantic type classification scheme for discourse relations. We finally demonstrate the feasibility of our approach with evaluation results.
引用
收藏
页码:397 / 431
页数:34
相关论文
共 50 条
  • [31] Chinese Discourse Studies
    Gavriely-Nuri, Dalia
    DISCOURSE STUDIES, 2016, 18 (04) : 475 - 477
  • [32] Chinese Discourse Studies
    Chen, Sibo
    DISCOURSE & SOCIETY, 2016, 27 (02) : 244 - 246
  • [33] A Survey of Discourse Representations for Chinese Discourse Annotation
    Kang, Xiaomian
    Zong, Chengqing
    Xue, Nianwen
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (03)
  • [34] Chinese discourse studies
    Ba, Tianyi
    LANGUAGE IN SOCIETY, 2017, 46 (02) : 271 - 272
  • [35] Chinese Discourse Studies
    Aolan, Mailihaba
    SYSTEM, 2016, 57 : 155 - 156
  • [36] Using Discourse Information for Education with a Spanish-Chinese Parallel Corpus
    Cao, Shuyuan
    Gete, Harritxu
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2254 - 2261
  • [37] A corpus-based discourse analysis of conversational storytelling in Chinese adults
    Zhao, Yurong
    Zhao, Yang
    CHINESE LANGUAGE AND DISCOURSE, 2014, 5 (01) : 53 - 78
  • [38] Genres in the Prague Discourse Treebank
    Polakova, Lucie
    Jinova, Pavlina
    Mirovsky, Jiri
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1320 - 1326
  • [39] A Multi-layer Annotated Corpus of Argumentative Text: From Argument Schemes to Discourse Relations
    Musi, Elena
    Alhindi, Tariq
    Stede, Manfred
    Kriese, Leonard
    Muresan, Smaranda
    Rocci, Andrea
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1629 - 1636
  • [40] Genres in the Prague discourse treebank
    20175004519436
    (1) Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics, Malostranské nám. 25, Prague 1; 118 00, Czech Republic, 1600, European Media Laboratory GmbH (EML); Holmes Semantic Solutions; IMMI; KDictionaries; VoiceBox Technologies (European Language Resources Association (ELRA)):