Korean-Chinese Person Name Translation for Cross Language Information Retrieval

被引:0
|
作者
Wang, Yu-Chun [1 ,2 ]
Lee, Yi-Hsun [1 ]
Lin, Chu-Cheng [1 ,3 ]
Tsai, Richard Tzong-Han [4 ]
Hsu, Wen-Lian [1 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] Natl Taiwan Univ, Dept Elect Engn, Taipei, Taiwan
[3] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[4] Yuan Ze Univ, Dept Comp Sci & Engn, Taoyuan, Taiwan
关键词
Person Name Translation; Korean-Chinese Cross Language Information Retrieval;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named entity translation plays an important role in many applications, such as information retrieval and machine translation. In this paper, we focus on translating person names, the most common type of name entity in Korean-Chinese cross language information retrieval (KCIR). Unlike other languages, Chinese uses characters (ideographs), which makes person name translation difficult because one syllable may map to several Chinese characters. We propose an effective hybrid person name translation method to improve the performance of KCIR. First, we use Wikipedia as a translation tool based on the inter-language links between the Korean edition and the Chinese or English editions. Second, we adopt the Naver people search engine to find the query name's Chinese or English translation. Third, we extract Korean-English transliteration pairs from Google snippets, and then search for the English-Chinese transliteration in the database of Taiwan's Central News Agency or in Google. The performance of KCIR using our method is over five times better than that of a dictionary-based system. The mean average precision is 0.3490 and the average recall is 0.7534. The method can deal with Chinese, Japanese, Korean, as well as non-CJK person name translation from Korean to Chinese. Hence, it substantially improves the performance of KCIR.
引用
收藏
页码:489 / +
页数:2
相关论文
共 50 条
  • [41] Strategy to block interference from the source language (cognate signifiants) in Korean-Chinese interpretation
    Kim, Hye-Rim
    META, 2006, 51 (02) : 247 - 262
  • [42] Translation disambiguation for cross-language information retrieval using context-based translation probability
    Kishida, Kazuaki
    Ishita, Emi
    JOURNAL OF INFORMATION SCIENCE, 2009, 35 (04) : 481 - 495
  • [43] Foreign name backward transliteration in Chinese-English cross-language image retrieval
    Lin, WC
    Yang, CH
    Chen, HH
    COMPARATIVE EVALUATION OF MULTILINGUAL INFORMATION ACCESS SYSTEMS, 2003, 3237 : 611 - 620
  • [44] Research Trend Overview for Chinese-English Cross Language Information Retrieval
    Wang, Juan
    Lyu, Peng-hui
    Ma, Xue-jing
    INTERNATIONAL CONFERENCE ON SOCIAL SCIENCE AND MANAGEMENT (ICSSM 2014), 2014, : 432 - 438
  • [45] Research on Cross Language Information Retrieval of Tibetan Chinese Based on Dynamic Dictionary
    Li, Ai-lin
    Yu, Hong-zhi
    Jiang, Tao
    COMPUTER SCIENCE AND TECHNOLOGY (CST2016), 2017, : 615 - 622
  • [46] A hybrid approach to query and document translation using a pivot language for cross-language information retrieval
    Kishida, Kazuaki
    Kando, Noriko
    ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 93 - 101
  • [47] Cross-Language Information Retrieval
    Federico, Marcello
    COMPUTATIONAL LINGUISTICS, 2011, 37 (02) : 411 - 412
  • [48] Cross-language information retrieval
    Oard, DW
    Diekema, AR
    ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 1998, 33 : 223 - 256
  • [49] Cross-language information retrieval
    Nie J.-Y.
    Synthesis Lectures on Human Language Technologies, 2010, 3 (01): : 1 - 142
  • [50] Cross Language Information Retrieval System
    Yang, Zhidan
    Yang, Zhiting
    PROCEEDINGS OF THE 2ND INTERNATIONAL FORUM ON MANAGEMENT, EDUCATION AND INFORMATION TECHNOLOGY APPLICATION (IFMEITA 2017), 2017, 130 : 373 - 377