Multi-word term indexing for Arabic document retrieval

被引:0
|
作者
Boulaknadel, Siham [1 ]
Daille, Beatrice [1 ]
Driss, Aboutajdine [2 ]
机构
[1] Univ Nantes, CNRS, FRE 2729, LINA, 2 Rue Houssinire,BP 92208, F-44322 Nantes 03, France
[2] Mohammed V Univ, GSCM, Rabat, Morocco
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To improve information retrieval system performances, it seems important to identify key phrases which constitute a better representation of text semantic content than single word terms. In this paper, we adapt the standard method for multi-word term extraction for Arabic language. We define the linguistic specifications and develop a term extraction tool. We experiment the term extraction program for document retrieval in a specific domain, evaluate two kinds of multi-word term weighting functions considering either the corpus or the document, and demonstrate the efficiency of multi-word term indexing for both weighting up to 5.8% of average precision.
引用
收藏
页码:480 / +
页数:3
相关论文
共 50 条
  • [1] Should one use term proximity or multi-word terms for Arabic information retrieval?
    El Mahdaouy, Abdelkader
    Gaussier, Eric
    El Alaoui, Said Ouatik
    COMPUTER SPEECH AND LANGUAGE, 2019, 58 : 76 - 97
  • [2] Impact of term-indexing for Arabic document retrieval
    Boulaknadel, Siham
    NATURAL LANGUAGE AND INFORMATION SYSTEMS, PROCEEDINGS, 2008, 5039 : 380 - 383
  • [3] Impact of term-indexing for arabic document retrieval
    LINA FRE CNRS 2729, Université de Nantes, 2 rue la Houssinière, 44322 Nantes Cedex 03, France
    不详
    Lect. Notes Comput. Sci., 2008, (380-383):
  • [4] A hybrid Approach for Arabic Multi-Word Term Extraction
    Bounhas, Ibrahim
    Slimani, Yahya
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 429 - 436
  • [5] A multi-word term extraction program for Arabic language
    Boulaknadel, Siham
    Daille, Beatrice
    Aboutajdine, Driss
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1485 - 1488
  • [6] Expansion of multi-word terms for indexing and retrieval using morphology and syntax
    Jacquemin, C
    Klavans, JL
    Tzoukermann, E
    35TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 8TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 1997, : 24 - 31
  • [7] Exploiting multi-word similarity for retrieval in medical document collections: The TSRM approach
    Drymonas, Euthymios
    Zervanou, Kalliopi
    Petrakis, Euripides G. M.
    Journal of Digital Information Management, 2010, 8 (05): : 315 - 321
  • [8] Arabic Document Indexing for Improved Text Retrieval
    Al-Lahham, Yaser A. M.
    2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2019, : 226 - 230
  • [9] Analysis and evaluation of the use of the multi-word term in the indexing of a collection of photographic postcards
    Begona Lopez-Avila, Ma
    da Graca de Melo-Simoes, Maria
    Rodriguez-Bravo, Blanca
    DOCUMENTACION DE LAS CIENCIAS DE LA INFORMACION, 2021, 44 (01): : 79 - 85
  • [10] Multi-word terms selection for information retrieval
    Bechikh Ali, Chedi
    Haddad, Hatem
    Slimani, Yahya
    INFORMATION DISCOVERY AND DELIVERY, 2023, 51 (01) : 74 - 87