Incorporating Linguistic Information to Statistical Word-Level Alignment

被引:0
|
作者
Cendejas, Eduardo [1 ]
Barcelo, Grettel [1 ]
Gelbukh, Alexander [1 ]
Sidorov, Grigori [1 ]
机构
[1] Natl Polytech Inst, Ctr Res Comp, Mexico City, DF, Mexico
关键词
Parallel texts; word alignment; linguistic information; dictionary; cognates; semantic domains; morphological information;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Parallel texts are enriched by alignment algorithms, thus establishing a relationship between the structures of the implied languages. Depending on the alignment level, the enrichment can be performed on paragraphs, sentences or words, of the expressed content in the source language and its translation. There are two main approaches to perform word-level alignment: statistical or linguistic. Due to the dissimilar grammar rules the languages have, the statistical algorithms usually give lower precision. That is why the development of this type of algorithms is generally aimed at a specific language pair using linguistic techniques. A hybrid alignment system based on the combination of the two traditional approaches is presented in this paper. It provides user-friendly configuration and is adaptable to the computational environment. The system uses linguistic resources and procedures such as identification of cognates, morphological information, syntactic trees, dictionaries, and semantic domains. We show that the system outperforms existing algorithms.
引用
收藏
页码:387 / 394
页数:8
相关论文
共 50 条
  • [31] Automatic Word-level Abstraction of Datapath
    Yu, Cunxi
    Ciesielski, Maciej
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 1718 - 1721
  • [32] Word-level neutrosophic sentiment similarity
    Smarandache, Florentin
    Colhon, Mihaela
    Vladutescu, Stefan
    Negrea, Xenia
    APPLIED SOFT COMPUTING, 2019, 80 : 167 - 176
  • [33] Formal verification of word-level specifications
    Höreth, S
    Drechsler, R
    DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION 1999, PROCEEDINGS, 1999, : 52 - 58
  • [34] Towards a typology of word-level causatives
    Li, Chao
    WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 2016, 62 (03): : 163 - 177
  • [35] Minimization of Word-Level Decision Diagrams
    Drechsler, R
    Günther, W
    Höreth, S
    INTEGRATION-THE VLSI JOURNAL, 2002, 33 (1-2) : 39 - 70
  • [36] Word-Level Structure Identification In FPGA Designs Using Cell Proximity Information
    Nathamuni-Venkatesan, Aparajithan
    Narayanan, Ram-Venkat
    Pula, Kishore
    Muthukumaran, Sundarakumar
    Vemuri, Ranga
    2023 36TH INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2023 22ND INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS, VLSID, 2023, : 389 - 394
  • [37] Word-level Perturbation Considering Word Length and Compositional Subwords
    Hiraoka, Tatsuya
    Takase, Sho
    Uchiumi, Kei
    Keyaki, Atsushi
    Okazaki, Naoaki
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3268 - 3275
  • [38] EFFECTS OF WORD-LEVEL AND SENTENCE-LEVEL CONTEXTS UPON WORD RECOGNITION
    COLOMBO, L
    WILLIAMS, J
    MEMORY & COGNITION, 1990, 18 (02) : 153 - 163
  • [39] DOES ENGLISH HAVE WORD-LEVEL RULES
    DOWNING, BT
    GENERAL LINGUISTICS, 1974, 14 (01): : 1 - 14
  • [40] Utilization of information from CNN feature maps for offline word-level writer identification
    Kumar, Vineet
    Sundaram, Suresh
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238