Error annotation: a review and faceted taxonomy

被引:0
|
作者
Eryigit, Guelsen [1 ,2 ]
Golynskaia, Anna [3 ]
Sayar, Elif [2 ]
Turker, Tolgahan [1 ]
机构
[1] Istanbul Tech Univ, Fac Comp & Informat, Istanbul, Turkiye
[2] Istanbul Tech Univ, Turkish Teaching Applicat & Res Ctr, Istanbul, Turkiye
[3] Yunus Emre Inst, Dept Educ, Ankara, Turkiye
关键词
Taxonomy; Learner corpus; Error classification;
D O I
10.1007/s10579-024-09794-0
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Classification of errors in language use plays a crucial role in language learning & teaching, error analysis studies, and language technology development. However, there is no standard and inclusive error classification method agreed upon among different disciplines, which causes repetition of similar efforts and a barrier in front of a common understanding in the field. This article brings a new and holistic perspective to error classifications and annotation schemes across different fields (i.e., learner corpora research, error analysis, grammar error correction, and machine translation), all serving the same purpose but employing different methods and approaches. The article first reviews previous error annotation efforts from different fields for nineteen languages with different characteristics, including the morphologically rich ones that pose diverse challenges for language technologies. It then introduces a faceted taxonomy for errors in language use, comprising multidimensional and hierarchical facets that can be utilized to create both fine- and coarse-grained error annotation schemes depending on specific requirements. We believe that the proposed taxonomy based on the principles of universality and diversity will address the emerging need for a common framework in error annotation.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] A faceted taxonomy of semantic integrity constraints for the XML data model
    Rodrigues, Khaue Rezende
    Mello, Ronaldo dos Santos
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, 4653 : 65 - +
  • [22] MODEL FOR CONCEPTUAL ANALYSIS' ON THESIS AND ISSERTATIONS FOR THE FACETED TAXONOMY CREATION
    Moreira dos Santos Maculan, Benildes Coura
    Borem de Oliveira Lima, Gercina Angela
    INFORMACAO & SOCIEDADE-ESTUDOS, 2011, 21 (03) : 41 - 54
  • [23] TOWARDS A FACETED TAXONOMY TO STRUCTURE WEB-GENRE CORPORA
    Ezeiza Ramos, Joseba
    Paya Ruiz, Xabier
    Elordui Urkiza, Agurtzane
    Epelde Pagola, Imanol
    REVISTA DE LINGUISTICA Y LENGUAS APLICADAS, 2011, 6 : 139 - 150
  • [24] Concept naming vs concept categorisation: a faceted approach to semantic annotation
    Prasad, A. R. D.
    Guha, Nabonita
    ONLINE INFORMATION REVIEW, 2008, 32 (04) : 500 - 510
  • [25] French error type annotation for dictation: A platform with automatic error type annotation for French dictation exercises
    Qin, Ying
    Luo, Yumeng
    Zhai, Yuming
    FRONTIERS IN PSYCHOLOGY, 2023, 13
  • [26] Patient Error: A Preliminary Taxonomy
    Buetow, Stephen
    Kiata, Liz
    Liew, Tess
    Kenealy, Tim
    Dovey, Susan
    Elwyn, Glyn
    ANNALS OF FAMILY MEDICINE, 2009, 7 (03) : 223 - 231
  • [27] Evaluating a medical error taxonomy
    Brixey, J
    Johnson, TR
    Zhang, JJ
    AMIA 2002 SYMPOSIUM, PROCEEDINGS: BIOMEDICAL INFORMATICS: ONE DISCIPLINE, 2002, : 71 - 75
  • [28] Medication error taxonomy available
    不详
    AMERICAN JOURNAL OF HEALTH-SYSTEM PHARMACY, 1999, 56 (04) : 314 - 314
  • [29] Error annotation in a Learner Corpus of Portuguese
    del Rio, Iria
    Mendes, Amalia
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4116 - 4119
  • [30] Towards a taxonomy for annotation of data science experiment repositories
    Sivasothy, Shangeetha
    Barnett, Scott
    Fernando, Niroshinie
    Vasa, Rajesh
    Sinha, Roopak
    Simmons, Andrew
    IEEE 21ST INTERNATIONAL WORKING CONFERENCE ON SOURCE CODE ANALYSIS AND MANIPULATION (SCAM 2021), 2021, : 76 - 80