Using electronic texts for an annotated corpus building

被引:9
|
作者
Galicia-Haro, SN [1 ]
机构
[1] Inst Politecn Nacl, Computat Res Ctr, Nat Language & Text Proc Lab, Mexico City 07738, DF, Mexico
关键词
D O I
10.1109/ENC.2003.1232870
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, collections of texts with annotations on several levels are useful resources. They are employed for diverse tasks in theoretical research and natural language applications. The most important collections are dedicated to English. However, huge efforts are required to develop the corresponding resource for other languages. In this work, we present the initial steps for the compilation of an annotated Mexican corpus using electronic texts obtained from the WEB.
引用
收藏
页码:26 / 32
页数:7
相关论文
共 50 条
  • [1] Building a semantically annotated corpus of clinical texts
    Roberts, Angus
    Gaizauskas, Robert
    Hepple, Mark
    Demetriou, George
    Guo, Yikun
    Roberts, Ian
    Setzer, Andrea
    JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (05) : 950 - 966
  • [2] Implicit Knowledge in Argumentative Texts: An Annotated Corpus
    Becker, Maria
    Korfhage, Katharina
    Frank, Anette
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2316 - 2324
  • [3] Towards a corpus of student texts annotated in discourse relations
    Bras, Myriam
    Vieu, Laure
    Joret, Maelle
    Pepin-Boutin, Audrey
    Poujade, Clamenca
    Roze, Charlotte
    LANGUE FRANCAISE, 2021, (211): : 115 - 129
  • [4] Building a parallel bilingual syntactically annotated corpus
    Curín, J
    Cmejrek, M
    Havelka, J
    Kubon, V
    NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 168 - 176
  • [5] The BioScope corpus: biomedical texts annotated for uncertainty, negation and their scopes
    Veronika Vincze
    György Szarvas
    Richárd Farkas
    György Móra
    János Csirik
    BMC Bioinformatics, 9
  • [6] The BioScope corpus: biomedical texts annotated for uncertainty, negation and their scopes
    Vincze, Veronika
    Szarvas, Gyoergy
    Farkas, Richard
    Mora, Gyoergy
    Csirik, Janos
    BMC BIOINFORMATICS, 2008, 9 (Suppl 11)
  • [7] Building and processing a multilingual corpus of parallel texts
    Stahl, P
    PARALLEL CORPORA, PARALLEL WORLDS, 2002, (43): : 169 - 179
  • [8] Building a Dialogue Corpus Annotated with Expressed and Experienced Emotions
    Ide, Tatsuya
    Kawahara, Daisuke
    Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2022, : 21 - 30
  • [9] Building an Annotated Corpus for Text Summarization and Question Answering
    Varasai, Patcharee
    Pechsiri, Chaveevan
    Sukvari, Thana
    Satayamas, Vee
    Kawtrakul, Asanee
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3427 - 3434
  • [10] Building a Dialogue Corpus Annotated with Expressed and Experienced Emotions
    Ide, Tatsuya
    Kawahara, Daisuke
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 21 - 30