Error annotation: a review and faceted taxonomy

被引:0
|
作者
Eryigit, Guelsen [1 ,2 ]
Golynskaia, Anna [3 ]
Sayar, Elif [2 ]
Turker, Tolgahan [1 ]
机构
[1] Istanbul Tech Univ, Fac Comp & Informat, Istanbul, Turkiye
[2] Istanbul Tech Univ, Turkish Teaching Applicat & Res Ctr, Istanbul, Turkiye
[3] Yunus Emre Inst, Dept Educ, Ankara, Turkiye
关键词
Taxonomy; Learner corpus; Error classification;
D O I
10.1007/s10579-024-09794-0
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Classification of errors in language use plays a crucial role in language learning & teaching, error analysis studies, and language technology development. However, there is no standard and inclusive error classification method agreed upon among different disciplines, which causes repetition of similar efforts and a barrier in front of a common understanding in the field. This article brings a new and holistic perspective to error classifications and annotation schemes across different fields (i.e., learner corpora research, error analysis, grammar error correction, and machine translation), all serving the same purpose but employing different methods and approaches. The article first reviews previous error annotation efforts from different fields for nineteen languages with different characteristics, including the morphologically rich ones that pose diverse challenges for language technologies. It then introduces a faceted taxonomy for errors in language use, comprising multidimensional and hierarchical facets that can be utilized to create both fine- and coarse-grained error annotation schemes depending on specific requirements. We believe that the proposed taxonomy based on the principles of universality and diversity will address the emerging need for a common framework in error annotation.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] Error Annotation of the Arabic Learner Corpus A New Error Tagset
    Alfaifi, Abdullah
    Atwell, Eric
    Abuhakema, Ghazi
    LANGUAGE PROCESSING AND KNOWLEDGE IN THE WEB, 2013, 8105 : 14 - 22
  • [32] Multiple-taxonomy question classification for category search on faceted information
    Tomas, David
    Vicedo, Jose L.
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2007, 4629 : 653 - 660
  • [33] Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction
    Bryant, Christopher
    Felice, Mariano
    Briscoe, Ted
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 793 - 805
  • [34] A Faceted Taxonomy for Rating Student Bibliographies in an Online Information Literacy Game
    Leeder, Chris
    Markey, Karen
    Yakel, Elizabeth
    COLLEGE & RESEARCH LIBRARIES, 2012, 73 (02): : 115 - 133
  • [35] Application of a new taxonomy for radiotherapy error reporting and results of a five-year review
    Bissonnette, J-P
    Medlam, G.
    MEDICAL PHYSICS, 2007, 34 (06) : 2420 - 2420
  • [36] Toward a Taxonomy of Public Health Error
    De Ville, Kenneth
    Novick, Lloyd F.
    JOURNAL OF PUBLIC HEALTH MANAGEMENT AND PRACTICE, 2010, 16 (03): : 216 - 220
  • [37] Linked Data and IIIF Integrating Taxonomy Management with Image Annotation
    Loh, Gene
    PROCEEDINGS OF THE 2017 PACIFIC NEIGHBORHOOD CONSORTIUM ANNUAL CONFERENCE AND JOINT MEETINGS (PNC), 2017, : 50 - 55
  • [38] Nearest-Neighbor Automatic Sound Annotation with a WordNet Taxonomy
    Pedro Cano
    Markus Koppenberger
    Sylvain Le Groux
    Julien Ricard
    Nicolas Wack
    Perfecto Herrera
    Journal of Intelligent Information Systems, 2005, 24 : 99 - 111
  • [39] Visual taxonomy for professional image retrieval and automated annotation of images
    Nurminen, Tuui
    2007 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, PROCEEDINGS, 2007, : 181 - 185
  • [40] Enhanced taxonomy annotation of antiviral activity data from ChEMBL
    Nikitina, Anastasia A.
    Orlov, Alexey A.
    Kozlovskaya, Liubov I.
    Palyulin, Vladimir A.
    Osolodkin, Dmitry I.
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2019,