Research of English-Chinese alignment at word granularity on parallel corpora

被引:0
|
作者
Xu Yang [1 ]
Wang Hou-feng [1 ]
Lue Xue-qiang [2 ]
机构
[1] Peking Univ, Beijing 100871, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Beijing 100101, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICIS.2008.28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bilingual alignment is a crucial problem in the research of natural language processing, and word alignment is a nodus among all granularities of alignment. This paper describes an English-Chinese word alignment model based on a bilingual lexicon and some language knowledge, which works on bilingual corpora. The model is built on the theory of formal optimal partition of the bilingual sentence pairs, and is ubiquitous to sentences pairs of any natural language. Particularly, we obtain some alignment strategies which are independent to alignment direction by denoting some definitions and proving a theorem. The model deals with part-matching cases, solves multi-appear-word problems and remedies the deficiency of bilingual lexicon. The experimental results show that the model can align bilingual corpora at word level effectively with a high accuracy, and maintain the grammar structure of the original sentences at the same time.
引用
收藏
页码:223 / +
页数:2
相关论文
共 50 条
  • [41] Oxford FLTRP English-Chinese Chinese-English Dictionary
    Du, Kaihuai
    AUSTRALIAN JOURNAL OF LINGUISTICS, 2012, 32 (02) : 292 - 296
  • [42] The creation of small-scale english-chinese parallel corpus for teaching ESP
    Bao, Songbin
    Zeng, Xiangling
    ICIC Express Letters, Part B: Applications, 2014, 5 (03): : 727 - 732
  • [43] Sentence Alignment of Hungarian-English Parallel Corpora Using a Hybrid Algorithm
    Toth, Krisztina
    Farkas, Richard
    Kocsor, Andras
    ACTA CYBERNETICA, 2008, 18 (03): : 463 - 478
  • [44] HONGYEECHUI - CHINESE-ENGLISH ENGLISH-CHINESE ASTRONOMICAL DICTIONARY
    CARLIER, A
    ANNALES D ASTROPHYSIQUE, 1966, 29 (06): : 699 - &
  • [45] Experiments with a PPM Compression-Based Method for English-Chinese Bilingual Sentence Alignment
    Liu, Wei
    Chang, Zhipeng
    Teahan, William J.
    STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2014, 2014, 8791 : 70 - 81
  • [46] English-Chinese Translation Based on Chinese Grammar
    Guo, Qing
    2015 4TH INTERNATIONAL CONFERENCE ON PHYSICAL EDUCATION AND SOCIETY MANAGEMENT (ICPESM 2015), PT 1, 2015, 47 : 243 - 248
  • [47] Automatic construction of web-based English/Chinese parallel corpora
    Tan Bin
    Zhou Xu-yan
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 114 - 117
  • [48] Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
    Zhang, Yujie
    Wang, Zhulong
    Uchimoto, Kiyotaka
    Ma, Qing
    Isahara, Hitoshi
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1025 - 1029
  • [49] Some snags in English-Chinese communication
    王宗炎
    外语教学与研究, 1991, (01) : 28 - 30
  • [50] Vocabulary Gap in English-Chinese Translation
    郭学文
    何晓军
    海外英语, 2013, (15) : 3 - 5