TEI-friendly annotation scheme for medieval named entities: a case on a Spanish medieval corpus

被引:0
|
作者
Elena Álvarez-Mellado
María Luisa Díez-Platas
Pablo Ruiz-Fabo
Helena Bermúdez
Salvador Ros
Elena González-Blanco
机构
[1] UNED University,Digital Humanities Innovation Lab (LINHD), School of Computer Science
[2] CoverWallet,undefined
来源
关键词
Named-entity annotation; Annotation scheme; Historical NER; Medieval named entities; Medieval Spanish corpus;
D O I
暂无
中图分类号
学科分类号
摘要
Medieval documents are a rich source of historical data. Performing named-entity recognition (NER) on this genre of texts can provide us with valuable historical evidence. However, traditional NER categories and schemes are usually designed with modern documents in mind (i.e. journalistic text) and the general-domain NER annotation schemes fail to capture the nature of medieval entities. In this paper we explore the challenges of performing named-entity annotation on a corpus of Spanish medieval documents: we discuss the mismatches that arise when applying traditional NER categories to a corpus of Spanish medieval documents and we propose a novel humanist-friendly TEI-compliant annotation scheme and guidelines intended to capture the particular nature of medieval entities.
引用
收藏
页码:525 / 549
页数:24
相关论文
共 27 条