An Annotated Corpus of Direct Speech

被引:0
|
作者
Lee, John [1 ]
Yeung, Chak Yan [1 ]
机构
[1] City Univ Hong Kong, Halliday Ctr Intelligent Applicat Language Studie, Dept Linguist & Translat, Hong Kong, Peoples R China
关键词
direct speech; coreference; corpus annotation;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
We propose a scheme for annotating direct speech in literary texts, based on the Text Encoding Initiative (TEI) and the coreference annotation guidelines from the Message Understanding Conference (MUC). The scheme encodes the speakers and listeners of utterances in a text, as well as the quotative verbs that reports the utterances. We measure inter-annotator agreement on this annotation task. We then present statistics on a manually annotated corpus that consists of books from the New Testament. Finally, we visualize the corpus as a conversational network.
引用
收藏
页码:1059 / 1063
页数:5
相关论文
共 50 条
  • [31] An Annotated Social Media Corpus for German
    Bick, Eckhard
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6127 - 6135
  • [32] FactBank: a corpus annotated with event factuality
    Sauri, Roser
    Pustejovsky, James
    LANGUAGE RESOURCES AND EVALUATION, 2009, 43 (03) : 227 - 268
  • [33] BAAC: Bangor Arabic Annotated Corpus
    Alkhazi, Ibrahim S.
    Teahan, William J.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (11) : 131 - 140
  • [34] NegPar: a parallel corpus annotated for negation
    Liu, Qianchu
    Fancellu, Federico
    Webber, Bonnie
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3464 - 3472
  • [35] LINGUISTICALLY ANNOTATED SPOKEN NGANASAN CORPUS
    Beata, Wagner-Nagy
    Sandor, Szeverenyi
    TOMSKII ZHURNAL LINGVISTICHESKIKH I ANTROPOLOGICHESKIKH ISSLEDOVANII-TOMSK JOURNAL OF LINGUISTICS AND ANTHROPOLOGY, 2015, (02): : 25 - 34
  • [36] An annotated corpus with nanomedicine and pharmacokinetic parameters
    Lewinski, Nastassja A.
    Jimenez, Ivan
    McInnes, Bridget T.
    INTERNATIONAL JOURNAL OF NANOMEDICINE, 2017, 12 : 7519 - 7527
  • [37] Corpus Linguistics and Linguistically Annotated Corpora
    McCallum, Lee
    ARAB WORLD ENGLISH JOURNAL, 2016, 7 (01) : 521 - 524
  • [38] Error-Annotated Corpus of Latvian
    Deksne, Daiga
    Skadina, Inguna
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, BALTIC HLT 2014, 2014, 268 : 163 - 166
  • [39] Sense-Annotated Corpus for Russian
    Kirillovich, Alexander
    Loukachevitch, Natalia
    Kulaev, Maksim
    Bolshina, Angelina
    Ilvovsky, Dmitry
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE COMPUTATIONAL LINGUISTICS IN BULGARIA, CLIB 2022, 2022, : 130 - 136
  • [40] Corpus Linguistics and Linguistically Annotated Corpora
    Xiao-Desai, Yang
    Kuebler, Sandra
    MODERN LANGUAGE JOURNAL, 2015, 99 (04): : 801 - 802