Annotating Attribution Relations in Arabic

被引:0
|
作者
Alsaif, Amal [1 ]
Alyahya, Tasniem [1 ]
Alotaibi, Madawi [1 ]
Almuzaini, Huda [1 ]
Algahtani, Abeer [1 ]
机构
[1] Al Imam Mohammad Ibn Saud Islamic Univ, Coll Comp Sci & Informat, Riyadh, Saudi Arabia
关键词
attribution; annotation tool; NLP; Arabic discourse; annotation guidelines; ATB; inter-annotator agreement; CORPUS;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present a first empirical effort in annotating attribution in Modern Standard Arabic (MSA). Identifying attributed arguments to the source is applied successfully in diverse systems such as authorship identification, information retrieval, and opinion mining. Current studies focus on using lexical terms in long texts to verify, for example, the author identity. While attribution identification in short texts is still unexplored completely due to the lack of resources such as annotated corpora and tools especially in Arabic on one hand, and the limited coverage of different attribution usages in Arabic literature, on other hand. The paper presents our guidelines for annotating attribution elements: cue, source, and the content with required syntactical and semantic features in Arabic news (Arabic TreeBank ATB) insight of earlier studies for other languages with all required adaptation. We also develop a new annotation tool for attribution in Arabic to ensure that all instances of attribution are reliably annotated. The results of a pilot annotation are discussed in addition to the inter-annotators agreement studies towards creating the first gold standard attribution corpus for Arabic.
引用
收藏
页码:4008 / 4015
页数:8
相关论文
共 50 条
  • [1] Annotating Attribution Relations: Towards an Italian Discourse Treebank
    Pareti, Silvia
    Prodanof, Irina
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 3566 - 3571
  • [2] Annotating an Arabic Learner Corpus for Error
    Abuhakema, Ghazi
    Faraj, Reem
    Feldman, Anna
    Fitzpatrick, Eileen
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1347 - 1350
  • [3] The Leeds Arabic Discourse Treebank: Annotating Discourse Connectives for Arabic
    Al-Saif, Amal
    Markert, Katja
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2046 - 2053
  • [4] Annotating Attribution in Czech News Server Articles
    Hladka, Barbora
    Mirovsky, Jiri
    Kopp, Matyas
    Moravec, Vaclav
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1817 - 1823
  • [5] Annotating Relations in Scientific Articles
    Meyers, Adam
    Lee, Giancarlo
    Grieve-Smith, Angus
    He, Yifan
    Taber, Harriet
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4601 - 4608
  • [6] Authorship Attribution of Arabic Articles
    Hajja, Maha
    Yahya, Ahmad
    Yahya, Adnan
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 194 - 208
  • [7] Authorship Attribution in Arabic Poetry
    Ahmed, Alfalahi
    Mohamed, Ramdani
    Mostafa, Bellafkih
    2016 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2016,
  • [8] Authorship Attribution of Arabic Tweets
    Rabab'ah, Abdullateef
    Al-Ayyoub, Mahmoud
    Jararweh, Yaser
    Aldwairi, Monther
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [9] Attribution in German and Arabic: The Adjective
    Selmani, Lirim
    DEUTSCHE SPRACHE, 2019, 47 (03): : 239 - 257
  • [10] Annotating and Learning Morphological Segmentation of Egyptian Colloquial Arabic
    Mohamed, Emad
    Mohit, Behrang
    Oflazer, Kemal
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 873 - 877