An Experimental Comparison of Explicit Semantic Analysis Implementations for Cross-Language Retrieval

被引:0
|
作者
Sorg, Philipp [1 ]
Cimiano, Philipp [2 ]
机构
[1] Univ Karlsruhe, Inst AIFB, Karlsruhe, Germany
[2] Delft Univ Technol, Web Informat Syst Grp, Delft, Netherlands
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Explicit Semantic Analysis (ESA) has been recently proposed as an approach to computing semantic relatedness between words (and indirectly also between texts) and has thus a natural application in information retrieval, showing the potential to alleviate the vocabulary mismatch problem inherent in standard Bag-of-Word models. The ESA model has been also recently extended to cross-lingual retrieval settings, which can be considered as an extreme case of the vocabulary mismatch problem. The ESA approach actually represents a class of approaches and allows for various instantiations. As our first contribution, we generalize ESA in order to clearly show the degrees of freedom it provides. Second, we propose some variants of ESA along different dimensions, testing their impact on performance on a cross-lingual mate retrieval task on two datasets (JRC-ACQUIS and Multext). Our results are interesting as a systematic investigation has been missing so far and the variations between different basic design choices are significant. We also show that the settings adopted in the original ESA implementation are reasonably good, which to our knowledge has not been demonstrated so far, but can still be significantly improved by tuning the right parameters (yielding a relative improvement on a cross-lingual mate retrieval task of between 62% (Multext) and 237% (JRC-ACQUIS) with respect to the original ESA model).
引用
收藏
页码:36 / +
页数:3
相关论文
共 50 条
  • [41] Language translation and media transformation in cross-language image retrieval
    Chen, Hsin-Hsi
    Chang, Yih-Chen
    DIGITAL LIBRARIES: ACHIEVEMENTS, CHALLENGES AND OPPORTUNITIES, PROCEEDINGS, 2006, 4312 : 350 - +
  • [42] Cross-language information retrieval using latent semantic indexing and self-organizing maps
    Ampazis, N
    Iakovaki, H
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 751 - 755
  • [43] Toward cross-language and cross-media image retrieval
    Alvarez, C
    Oumohmed, AI
    Mignotte, M
    Nie, JY
    MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 676 - 687
  • [44] Comparative Analysis of Information Retrieval Models on Quran Dataset in Cross-Language Information Retrieval Systems
    Taan, Ayman A.
    Khan, Shafiq Ur Rehman
    Raza, Ali
    Hanif, Ayaz Muhammad
    Anwar, Hira
    IEEE ACCESS, 2021, 9 : 169056 - 169067
  • [45] Word sense disambiguation for cross-language information retrieval
    Liu, MX
    Diamond, T
    Diekema, AR
    6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : B35 - B40
  • [46] Cross-language Information Retrieval Based on Multiple Information
    Liu, Pengyuan
    Zheng, Zhijun
    Su, Qi
    2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 623 - 626
  • [47] Online Learning to Rank for Cross-Language Information Retrieval
    Rahimi, Razieh
    Shakery, Azadeh
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1033 - 1036
  • [48] Comparative evaluation of cross-language information retrieval systems
    Peters, Carol
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2005, 3379 LNCS : 152 - 161
  • [49] SEMCL: A Cross-Language Semantic Model for Knowledge Sharing
    Guo, Weisen
    Kraines, Steven B.
    INTERNATIONAL JOURNAL OF KNOWLEDGE AND SYSTEMS SCIENCE, 2010, 1 (03) : 1 - 19
  • [50] Cross-language semantic influences in different script bilinguals
    Degani, Tamar
    Prior, Anat
    Hajajra, Walaa
    BILINGUALISM-LANGUAGE AND COGNITION, 2018, 21 (04) : 782 - 804