A class-based approach to word alignment

被引:0
|
作者
Ker, SJ
Chang, JS
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an algorithm capable of identifying the translation for each word in a bilingual corpus. Previously proposed methods rely heavily on word-based statistics. Under a word-based approach, frequent words with a consistent translation can be aligned at a high rate of precision. However, words that are less frequent or exhibit diverse translations generally no not have statistically significant evidence for confident alignment, thereby leading to incomplete or incorrect alignments. The algorithm proposed herein attempts to broaden coverage by exploiting lexicographic resources. To this end, we draw on the two classification systems of words in Longman Lexicon of Contemporary English (LLOCE) and Tongyici Cilin (Synonym Forest, CILIN). Automatically acquired class-based alignment rules are used to compensate for what is lacking in a bilingual dictionary such as the English-Chinese version of the Longman Dictionary of Contemporary English (LecDOCE). In addition, this alignment method is implemented using LecDOCE examples and their translations for training and testing, while further examples from a technical manual in both English and Chinese are used for an open test. Quantitative results of the closed and open tests are also summarized.
引用
收藏
页码:313 / 343
页数:31
相关论文
共 50 条
  • [1] Chinese unknown word identification using class-based LM
    Fu, GH
    Luke, KK
    NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 704 - 713
  • [2] Class-based approach to disambiguating Levin verbs
    Li, Jianguo
    Brew, Chris
    NATURAL LANGUAGE ENGINEERING, 2010, 16 : 391 - 415
  • [3] Introduction - Migration and Identities - A class-based approach
    Kearney, M
    Beserra, B
    LATIN AMERICAN PERSPECTIVES, 2004, 31 (05) : 3 - 14
  • [4] An Empirical Investigation of Word Class-Based Features for Natural Language Understanding
    Celikyilmaz, Asli
    Sarikaya, Ruhi
    Jeong, Minwoo
    Deoras, Anoop
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (06) : 994 - 1005
  • [5] RNN language model with word clustering and class-based output layer
    Yongzhe Shi
    Wei-Qiang Zhang
    Jia Liu
    Michael T Johnson
    EURASIP Journal on Audio, Speech, and Music Processing, 2013
  • [6] RNN language model with word clustering and class-based output layer
    Shi, Yongzhe
    Zhang, Wei-Qiang
    Liu, Jia
    Johnson, Michael T.
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
  • [7] A STATE-BASED APPROACH TO THE TESTING OF CLASS-BASED PROGRAMS
    TURNER, CD
    ROBSON, DJ
    SOFTWARE-CONCEPTS AND TOOLS, 1995, 16 (03): : 106 - 112
  • [8] A class-based evaluation approach to assess multidimensional projections
    Teixeira, Jaqueline
    Marcilio-Jr, Wilson E.
    Eler, Danilo M.
    Artero, Almir
    Brandoli, Bruno
    2020 24TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV 2020), 2020, : 174 - 181
  • [9] Heuristic approach to warehouse layout with class-based storage
    Larson, T.N.
    March, H.
    Kusiak, A.
    IIE Transactions (Institute of Industrial Engineers), 1997, 29 (04): : 337 - 348
  • [10] A chemical class-based approach to predictive model generation
    Miller, DW
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (02): : 568 - 578