Annotation of multiword expressions in the Prague dependency treebank

被引:0
|
作者
Eduard Bejček
Pavel Straňák
机构
[1] Charles University in Prague,Institute of Formal and Applied Linguistics
来源
关键词
Multiword expressions; Treebanks; Annotation; Inter-annotator agreement; Named entities;
D O I
暂无
中图分类号
学科分类号
摘要
We describe annotation of multiword expressions (MWEs) in the Prague dependency treebank, using several automatic pre-annotation steps. We use subtrees of the tectogrammatical tree structures of the Prague dependency treebank to store representations of the MWEs in the dictionary and pre-annotate following occurrences automatically. We also show a way to measure reliability of this type of annotation.
引用
收藏
页码:7 / 21
页数:14
相关论文
共 50 条
  • [1] Annotation of multiword expressions in the Prague dependency treebank
    Bejcek, Eduard
    Stranak, Pavel
    LANGUAGE RESOURCES AND EVALUATION, 2010, 44 (1-2) : 7 - 21
  • [2] Annotation of discourse phenomena in the Prague Dependency Treebank
    Zikanova, Sarka
    Polakova, Lucie
    Jinova, Pavlina
    Nedoluzhko, Anna
    Rysova, Magdalena
    Mirovsky, Jiri
    Hajicova, Eva
    SLOVO A SLOVESNOST, 2015, 76 (03): : 163 - 197
  • [3] Prague Dependency Treebank Annotation Errors A Preliminary Analysis
    Kovar, Vojtech
    Jakubicek, Milos
    RASLAN 2009: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING, 2009, : 101 - 108
  • [4] Post-annotation checking of Prague Dependency Treebank 2.0 data
    Stepanek, Jan
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 277 - 284
  • [5] Analyzing Text Coherence via Multiple Annotation in the Prague Dependency Treebank
    Rysova, Katerina
    Rysova, Magdalena
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 71 - 79
  • [6] Morphology Within the Multi-layered Annotation Scenario of the Prague Dependency Treebank
    Sevcikova, Magda
    SYSTEMS AND FRAMEWORKS FOR COMPUTATIONAL MORPHOLOGY (SFCM 2015), 2015, 537 : 1 - 26
  • [7] A Romanian Treebank Annotated with Verbal Multiword Expressions
    Mititelu, Verginica Barbu
    Cristescu, Mihaela
    Mitrofan, Maria
    Zgreaban, Bianca-Madalina
    Barbulescu, Elena-Andreea
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE COMPUTATIONAL LINGUISTICS IN BULGARIA, CLIB 2022, 2022, : 137 - 145
  • [8] Influence of Treebank Design on Representation of Multiword Expressions
    Bejcek, Eduard
    Stranak, Pavel
    Zeman, Daniel
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT I, 2011, 6608 : 1 - 14
  • [9] Prague Dependency Treebank - Consolidated 1.0
    Hajic, Jan
    Bejcek, Eduard
    Hlavacova, Jaroslava
    Mikulova, Marie
    Straka, Milan
    Stepanek, Jan
    Stepankova, Barbora
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5208 - 5218
  • [10] Prague Dependency Treebank:: Restoration of deletions
    Hajicová, E
    Kruijff-Korbayová, I
    Sgall, P
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 44 - 49