Context-based generic cross-lingual retrieval of documents and automated summaries

被引:4
|
作者
Lam, W [1 ]
Chan, K
Radev, D
Saggion, H
Teufel, S
机构
[1] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Shatin, Hong Kong, Peoples R China
[2] Univ Michigan, Sch Informat, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Dept EECS, Ann Arbor, MI 48109 USA
[4] Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England
[5] Univ Cambridge, Comp Lab, Cambridge CB0 3FD, England
关键词
D O I
10.1002/asi.20104
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We develop a context-based generic cross-lingual retrieval model that can deal with different language pairs. Our model considers contexts in the query translation process. Contexts in the query as well as in the documents based on co-occurrence statistics from different granularity of passages are exploited. We also investigate cross-lingual retrieval of automatic generic summaries. We have implemented our model for two different cross-lingual settings, namely, retrieving Chinese documents from English queries as well as retrieving English documents from Chinese queries. Extensive experiments have been conducted on a large-scale parallel corpus enabling studies on retrieval performance for two different cross-lingual settings of full-length documents as well as automated summaries.
引用
收藏
页码:129 / 139
页数:11
相关论文
共 50 条
  • [31] CrossMath: Towards Cross-lingual Math Information Retrieval
    Gore, James
    Polletta, Joseph
    Mansouri, Behrooz
    PROCEEDINGS OF THE 2024 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2024, 2024, : 101 - 105
  • [32] Cross-lingual and cross-domain discourse segmentation of entire documents
    Braud, Chloe
    Lacroix, Ophelie
    Sogaard, Anders
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 237 - 243
  • [33] A method of cross-lingual consumer health information retrieval
    Neveol, Aurelie
    Pereira, Suzanne
    Soualmia, Lina F.
    Thirion, Benoit
    Darmoni, Stefan J.
    UBIQUITY: TECHNOLOGIES FOR BETTER HEALTH IN AGING SOCIETIES, 2006, 124 : 601 - 608
  • [34] Effective translation, tokenization and combination for cross-lingual retrieval
    Kamps, J
    Adafre, SF
    de Rijke, M
    MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 123 - 134
  • [35] Exploiting Wikipedia for cross-lingual and multilingual information retrieval
    Sorg, P.
    Cimiano, P.
    DATA & KNOWLEDGE ENGINEERING, 2012, 74 : 26 - 45
  • [36] Cross-Lingual Information Retrieval System for Indian Languages
    Jagarlamudi, Jagadeesh
    Kumaran, A.
    ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 80 - 87
  • [37] CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer
    Wang, Yabing
    Wang, Fan
    Dong, Jianfeng
    Luo, Hao
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5651 - 5659
  • [38] Ontology-based Tamil–English cross-lingual information retrieval system
    D Thenmozhi
    Chandrabose Aravindan
    Sādhanā, 2018, 43
  • [39] Monolingual and Cross-Lingual Information Retrieval Models Based on (Bilingual) Word Embeddings
    Vulic, Ivan
    Moens, Marie-Francine
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 363 - 372
  • [40] Fuzzy conceptual indexing for concept-based cross-lingual text retrieval
    Chau, R
    Yeh, CH
    IEEE INTERNET COMPUTING, 2004, 8 (05) : 14 - 21