Multi-word term indexing for Arabic document retrieval

被引:0
|
作者
Boulaknadel, Siham [1 ]
Daille, Beatrice [1 ]
Driss, Aboutajdine [2 ]
机构
[1] Univ Nantes, CNRS, FRE 2729, LINA, 2 Rue Houssinire,BP 92208, F-44322 Nantes 03, France
[2] Mohammed V Univ, GSCM, Rabat, Morocco
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To improve information retrieval system performances, it seems important to identify key phrases which constitute a better representation of text semantic content than single word terms. In this paper, we adapt the standard method for multi-word term extraction for Arabic language. We define the linguistic specifications and develop a term extraction tool. We experiment the term extraction program for document retrieval in a specific domain, evaluate two kinds of multi-word term weighting functions considering either the corpus or the document, and demonstrate the efficiency of multi-word term indexing for both weighting up to 5.8% of average precision.
引用
收藏
页码:480 / +
页数:3
相关论文
共 50 条
  • [21] Comparison of word and subword indexing techniques for Mandarin Chinese spoken document retrieval
    Wang, HM
    Chen, BL
    ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 606 - 613
  • [22] Web document indexing and retrieval
    Hyusein, B
    Patel, A
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 573 - 579
  • [23] A Hybrid Model for Arabic Document Indexing
    Ben Guirat, Souheila
    Bounhas, Ibrahim
    Slimani, Yahya
    2016 17TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2016, : 109 - 114
  • [24] Acronyms as an Integral Part of Multi-Word Term Recognition - A Token of Appreciation
    Spasic, Irena
    IEEE ACCESS, 2018, 6 : 8351 - 8363
  • [25] Term Extraction For A Single & Multi-Word Based On Islamic Corpus English
    Abduljabbar, Waleed Khalid
    Tomah, Saadiyaa A.
    Ali, Ammar Abdulateef
    2018 1ST ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION AND SCIENCES (AICIS 2018), 2018, : 107 - 111
  • [26] The Oil Field Multi-word Term Recognition Based on Hybrid Strategy
    Liang, Ying-hong
    Liang, Ying-hong
    Li, Jin-xiang
    Xian, Xue-feng
    Chen, Ke
    2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (ICCSAI 2013), 2013, : 395 - 398
  • [27] Rule-based Automatic Multi-Word Term Extraction and Lemmatization
    Stankovic, Ranka
    Krstev, Cvetana
    Obradovic, Ivan
    Lazic, Biljana
    Trtovac, Aleksandra
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 507 - 514
  • [28] Multi-word Term Translation: A Student-Centered Pilot Study
    Bullon, Sandra
    Leon-Arauz, Pilar
    COMPUTATIONAL AND CORPUS-BASED PHRASEOLOGY, 2022, 13528 : 47 - 61
  • [29] On the Creation of a Corpus-Derived Medical Multi-Word Term List
    Florescu, Cosmin Mihail
    Ohniwa, Ryosuke L.
    INFORMATION, 2025, 16 (02)
  • [30] A Contrastive Approach to Multi-word Term Extraction from Domain Corpora
    Bonin, Francesca
    Dell'Orletta, Felice
    Venturi, Giulia
    Montemagni, Simonetta
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,