Korean-Chinese Person Name Translation for Cross Language Information Retrieval

被引:0
|
作者
Wang, Yu-Chun [1 ,2 ]
Lee, Yi-Hsun [1 ]
Lin, Chu-Cheng [1 ,3 ]
Tsai, Richard Tzong-Han [4 ]
Hsu, Wen-Lian [1 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] Natl Taiwan Univ, Dept Elect Engn, Taipei, Taiwan
[3] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[4] Yuan Ze Univ, Dept Comp Sci & Engn, Taoyuan, Taiwan
关键词
Person Name Translation; Korean-Chinese Cross Language Information Retrieval;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named entity translation plays an important role in many applications, such as information retrieval and machine translation. In this paper, we focus on translating person names, the most common type of name entity in Korean-Chinese cross language information retrieval (KCIR). Unlike other languages, Chinese uses characters (ideographs), which makes person name translation difficult because one syllable may map to several Chinese characters. We propose an effective hybrid person name translation method to improve the performance of KCIR. First, we use Wikipedia as a translation tool based on the inter-language links between the Korean edition and the Chinese or English editions. Second, we adopt the Naver people search engine to find the query name's Chinese or English translation. Third, we extract Korean-English transliteration pairs from Google snippets, and then search for the English-Chinese transliteration in the database of Taiwan's Central News Agency or in Google. The performance of KCIR using our method is over five times better than that of a dictionary-based system. The mean average precision is 0.3490 and the average recall is 0.7534. The method can deal with Chinese, Japanese, Korean, as well as non-CJK person name translation from Korean to Chinese. Hence, it substantially improves the performance of KCIR.
引用
收藏
页码:489 / +
页数:2
相关论文
共 50 条
  • [31] Phrasal translation and query expansion techniques for cross-language information retrieval
    Ballesteros, L
    Croft, WB
    PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1997, : 84 - 91
  • [32] Translation & transform algorithm of query sentence in cross-language information retrieval
    Zhang, Xiao-fei
    Zhang, Ke-liang
    Huang, He-yan
    PACLIC 20: PROCEEDINGS OF THE 20TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2006, : 467 - 470
  • [33] Query Translation Selection for Cross-Language Information Retrieval Based on HowNet
    Zhu, Honglei
    Zheng, Dequan
    Zhao, Tiejun
    IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 110 - 115
  • [34] Character N-grams translation in cross-language information retrieval
    Vilares, Jesus
    Oakes, Michael P.
    Vilares, Manuel
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4592 : 217 - +
  • [35] A comparative study of query and document translation for cross-language information retrieval
    Oard, DW
    MACHINE TRANSLATION AND THE INFORMATION SOUP, 1998, 1529 : 472 - 483
  • [36] Combining Query Translation Techniques to Improve Cross-Language Information Retrieval
    Herbert, Benjamin
    Szarvas, Gyoergy
    Gurevych, Iryna
    ADVANCES IN INFORMATION RETRIEVAL, 2011, 6611 : 712 - 715
  • [37] A progressive algorithm for cross-language information retrieval based on dictionary translation
    Yuan, Song An
    Yu, Song Nian
    INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 246 - 255
  • [38] Indonesian-English Transitive Translation for Cross-Language Information Retrieval
    Adriani, Mirna
    Hayurani, Herika
    Sari, Syandra
    ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 127 - 133
  • [39] Combining Lexical and Statistical Translation Evidence for Cross-Language Information Retrieval
    Kim, Sungho
    Ko, Youngjoong
    Oard, Douglas W.
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2015, 66 (01) : 23 - 39
  • [40] A hybrid approach of noun phrase translation in cross-language information retrieval
    Nguyen, Thanh C.
    Nguyen, Hieu V.
    Phan, Tuoi T.
    ASIAN DIGITAL LIBRARIES: LOOKING BACK 10 YEARS AND FORGING NEW FRONTIERS, PROCEEDINGS, 2007, 4822 : 502 - 503