Classification of Errors in Text

被引:0
|
作者
Busta, Jan [1 ]
Hlavackova, Dana [1 ]
Jakubicek, Milos [1 ]
Pala, Karel [1 ]
机构
[1] Masaryk Univ, Fac Informat, Bot 68a, Brno 60200, Czech Republic
关键词
errors in text; classification of errors;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents two classifications of errors in Czech texts. As a basic resource we use the corpus (Chyby - Errors) which has been continuously developed from 1999-2000 ([1]). The corpus text contains various kinds of errors such as spelling, typographical, grammatical, semantic, lexical, and stylistic ones. They have been corrected manually and annotated according to the classification of errors (annotation scheme) developed for this purpose. For the annotation we implemented a tool named WinCorr. We mention the first annotation scheme and discuss the second one which has been designed recently to obtain more adequate description of the errors occurring in texts. We also discuss the principles on which both classifications are based.
引用
收藏
页码:109 / 119
页数:11
相关论文
共 50 条
  • [21] Online text classification
    Guan, JH
    Zhou, SG
    CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (01): : 21 - 26
  • [22] Text Document Classification
    Novovicova, Jana
    ERCIM NEWS, 2005, (62): : 53 - 54
  • [23] Contrastive learning with text augmentation for text classification
    Jia, Ouyang
    Huang, Huimin
    Ren, Jiaxin
    Xie, Luodi
    Xiao, Yinyin
    APPLIED INTELLIGENCE, 2023, 53 (16) : 19522 - 19531
  • [24] Contrastive learning with text augmentation for text classification
    Ouyang Jia
    Huimin Huang
    Jiaxin Ren
    Luodi Xie
    Yinyin Xiao
    Applied Intelligence, 2023, 53 : 19522 - 19531
  • [25] Multidimensional Text Warehousing for Automated Text Classification
    Kim, Jiyun
    Kim, Han-joon
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2018, 11 (02) : 168 - 183
  • [26] Using Collaborative Tagging for Text Classification: From Text Classification to Opinion Mining
    Charton, Eric
    Meurs, Marie-Jean
    Jean-Louis, Ludovic
    Gagnon, Michel
    INFORMATICS-BASEL, 2014, 1 (01): : 32 - 51
  • [27] A Method of Chinese Text Detecting Errors Based on Recognition Errors by OCR
    Tian Zhuo
    Li Baicheng
    MODERN TECHNOLOGIES IN MATERIALS, MECHANICS AND INTELLIGENT SYSTEMS, 2014, 1049 : 1540 - 1543
  • [28] TFix: Learning to Fix Coding Errors with a Text-to-Text Transformer
    Berabi, Berkay
    He, Jingxuan
    Raychev, Veselin
    Vechev, Martin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [29] SAW Classification Algorithm for Chinese Text Classification
    Guo, Xiaoli
    Sun, Huiyu
    Zhou, Tiehua
    Wang, Ling
    Qu, Zhaoyang
    Zang, Jiannan
    SUSTAINABILITY, 2015, 7 (03) : 2338 - 2352
  • [30] Classification of proficiency testing errors
    Padget, B.
    McBride, E.
    Hannon, J.
    TRANSFUSION, 2008, 48 (02) : 282A - 282A