Automatic extraction of translation patterns from bilingual legal corpus

被引:0
|
作者
Ohara, M [1 ]
Matsubara, S [1 ]
Inagaki, Y [1 ]
机构
[1] Nagoya Univ, Grad Sch Engn, Chikusa Ku, Nagoya, Aichi 4648603, Japan
关键词
translation pattern; legal document; bilingual corpus; dictionary; statistical alignment; dependency structure;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The multilingualization of legal documents is desirable for promoting the internationalization of the society. Since it is vital to choose proper terms when translating legal documents, which include technical terms and unique patterns, it is desirable to compile bilingual dictionaries for each legal domain. Compiling basic bilingual dictionaries for legal documents, however, is a difficult task because of the great range of legal documents. This paper describes a method for automatically extracting translation patterns for legal document translation by using legal documents and their translated documents. The proposed method extracts translation patterns with Japanese bunsetsu-level units from legal sentences and the translated sentences that are properly aligned with each other. The proposed method utilizes three indexes for pattern extraction: bilingual dictionaries, statistical co-occurrence information on the parallel corpus, and syntactic information based on dependency grammar. We have extracted translation patterns from the Japanese Civil Code and its translation. The result has provided 80.5% precision and 49.1% recall, and the extracted translation patterns will be useful for translating legal documents and helping to construct a Japanese-English legal dictionary.
引用
收藏
页码:150 / 157
页数:8
相关论文
共 50 条
  • [1] Automatic extraction of translation patterns from bilingual legal corpus
    Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya
    464-8603, Japan
    不详
    464-8603, Japan
    不详
    464-8603, Japan
    NLP-KE - Int. Conf. Nat. Lang. Process. Knowl. Eng., Proc., 1600, (150-157):
  • [2] Automatic Extraction of Super-Function From Bilingual Corpus
    Sasayama, Manabu
    Ren, Fuji
    Kuroiwa, Shingo
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2009, 225 : 329 - 340
  • [3] Automatic Extraction of Super-Function From Bilingual Corpus
    Sasayama, Manabu
    Ren, Fuji
    Kuroiwa, Shingo
    Electronic Notes in Theoretical Computer Science, 2009, 225 (0C): : 329 - 340
  • [4] Bilingual Terminology Extraction based on Translation Patterns
    Simoes, Alberto
    Almeida, Jose Joao
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (41): : 281 - 288
  • [5] Automatic discovery of translation collocations from bilingual corpora
    Barrachina, S
    Vilar, JM
    ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 571 - 575
  • [6] Learning translation rules from bilingual English-Filipino corpus
    Tan, Michelle Wendy
    Ang, Raymond Joseph
    Bautista, Natasja Gail
    Cai, Ya Rong
    Tanlo, Bianca
    PACLIC 19: THE 19TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2005, : 211 - 219
  • [7] Extraction of collocations and their equivalents of translation from parallel corpus
    Seretan, Violeta
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2009, 50 (01): : 305 - 332
  • [8] Improving automatic Chinese-Japanese patent translation using bilingual term extraction
    Yang, Wei
    Lepage, Yves
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2018, 13 (01) : 117 - 125
  • [9] Automatic building of a machine translation bilingual dictionary using recursive chain-link-type learning from a parallel corpus
    Echizen-ya, H
    Araki, K
    Momouchi, Y
    Tochinai, K
    MICAI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2004, 2972 : 410 - 419
  • [10] Automatic extraction of aspectual information from a monolingual corpus
    Oishi, A
    Matsumoto, Y
    35TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 8TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 1997, : 352 - 359