Annotating Attribution Relations in Arabic

被引:0
|
作者
Alsaif, Amal [1 ]
Alyahya, Tasniem [1 ]
Alotaibi, Madawi [1 ]
Almuzaini, Huda [1 ]
Algahtani, Abeer [1 ]
机构
[1] Al Imam Mohammad Ibn Saud Islamic Univ, Coll Comp Sci & Informat, Riyadh, Saudi Arabia
关键词
attribution; annotation tool; NLP; Arabic discourse; annotation guidelines; ATB; inter-annotator agreement; CORPUS;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present a first empirical effort in annotating attribution in Modern Standard Arabic (MSA). Identifying attributed arguments to the source is applied successfully in diverse systems such as authorship identification, information retrieval, and opinion mining. Current studies focus on using lexical terms in long texts to verify, for example, the author identity. While attribution identification in short texts is still unexplored completely due to the lack of resources such as annotated corpora and tools especially in Arabic on one hand, and the limited coverage of different attribution usages in Arabic literature, on other hand. The paper presents our guidelines for annotating attribution elements: cue, source, and the content with required syntactical and semantic features in Arabic news (Arabic TreeBank ATB) insight of earlier studies for other languages with all required adaptation. We also develop a new annotation tool for attribution in Arabic to ensure that all instances of attribution are reliably annotated. The results of a pilot annotation are discussed in addition to the inter-annotators agreement studies towards creating the first gold standard attribution corpus for Arabic.
引用
收藏
页码:4008 / 4015
页数:8
相关论文
共 50 条
  • [21] Annotating Objects and Relations in User-Generated Videos
    Shang, Xindi
    Di, Donglin
    Xiao, Junbin
    Cao, Yu
    Yang, Xun
    Chua, Tat-Seng
    ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 279 - 287
  • [22] Annotating Temporal Relations to Determine the Onset of Psychosis Symptoms
    Viani, Natalia
    Kam, Joyce
    Yin, Lucia
    Verma, Somain
    Stewart, Robert
    Patel, Rashmi
    Velupillai, Sumithra
    MEDINFO 2019: HEALTH AND WELLBEING E-NETWORKS FOR ALL, 2019, 264 : 418 - 422
  • [23] Annotating Qualia Relations in Italian and French Complex Nominals
    Bouillon, Pierrette
    Jezek, Elisabetta
    Melloni, Chiara
    Picton, Aurelie
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1527 - 1532
  • [24] Toward a discourse theory for annotating causal relations in Japanese
    Kaneko, Kimi
    Bekki, Daisuke
    Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation, PACLIC 2014, 2014, : 460 - 469
  • [25] IMPACT OF GENRE ON AUTHORSHIP ATTRIBUTION IN ARABIC POETRY AND PROSE
    Mohamed, Emad
    Elewa, Abdelhamid
    INTERNATIONAL JOURNAL OF HUMANITIES AND ARTS COMPUTING-A JOURNAL OF DIGITAL HUMANITIES, 2025, 19 (01): : 65 - 84
  • [26] Arabic Authorship Attribution: An Extensive Study on Twitter Posts
    Altakrori, Malik H.
    Iqbal, Farkhund
    Fung, Benjamin C. M.
    Ding, Steven H. H.
    Tubaishat, Abdallah
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (01)
  • [27] Naive Bayes classifiers for authorship attribution of Arabic texts
    Altheneyan, Alaa Saleh
    Menai, Mohamed El Bachir
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2014, 26 (04) : 473 - 484
  • [28] A Comparative Survey of Authorship Attribution on Short Arabic Texts
    Ouamour, Siham
    Sayoud, Halim
    SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 479 - 489
  • [29] Annotating Verbal Multiword Expressions in Arabic: Assessing the Validity of a Multilingual Annotation Procedure
    Mohamed, Najet Hadj
    Ben Khelil, Cherifa
    Savary, Agata
    Keskes, Iskandar
    Antoine, Jean-Yves
    Hadrich, Lamia Belguith
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1839 - 1848
  • [30] An Attribution Relations Corpus for Political News
    Newell, Edward
    Margolin, Drew
    Ruths, Derek
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3315 - 3322