Iterative Bilingual Lexicon Extraction from Comparable Corpora with Topical and Contextual Knowledge

被引:0
|
作者
Chu, Chenhui [1 ]
Nakazawa, Toshiaki [1 ]
Kurohashi, Sadao [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the literature, two main categories of methods have been proposed for bilingual lexicon extraction from comparable corpora, namely topic model and context based methods. In this paper, we present a bilingual lexicon extraction systemthat is based on a novel combination of these two methods in an iterative process. Our system does not rely on any prior knowledge and the performance can be iteratively improved. To the best of our knowledge, this is the first study that iteratively exploits both topical and contextual knowledge for bilingual lexicon extraction. Experiments conduct on Chinese-English and Japanese-English Wikipedia data show that our proposed method performs significantly better than a state-of-the-art method that only uses topical knowledge.
引用
收藏
页码:296 / 309
页数:14
相关论文
共 50 条
  • [1] Addressing polysemy in bilingual lexicon extraction from comparable corpora
    Fiser, Darja
    Ljubesic, Nikola
    Kubelka, Ozren
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3031 - 3035
  • [2] Bilingual Lexicon Extraction with Forced Correlation from Comparable Corpora
    Zhang, Chunyue
    Zhao, Tiejun
    NEURAL INFORMATION PROCESSING, PT II, 2015, 9490 : 528 - 535
  • [3] Adaptive Dictionary for Bilingual Lexicon Extraction from Comparable Corpora
    Hazem, Amir
    Morin, Emmanuel
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 288 - 292
  • [4] Looking at Unbalanced Specialized Comparable Corpora for Bilingual Lexicon Extraction
    Morin, Emmanuel
    Hazem, Amir
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1284 - 1293
  • [5] Exploiting unbalanced specialized comparable corpora for bilingual lexicon extraction
    Morin, Emmanuel
    Hazem, Amir
    NATURAL LANGUAGE ENGINEERING, 2016, 22 (04) : 575 - 601
  • [6] Bilingual Lexicon Extraction from Comparable Corpora Based on Closed Concepts Mining
    Chebel, Mohamed
    Latiri, Chiraz
    Gaussier, Eric
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT I, 2017, 10234 : 586 - 598
  • [7] Bilingual Lexicon Extraction with Temporal Distributed Word Representation from Comparable Corpora
    Zhang, Chunyue
    Zhao, Tiejun
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2015, 2015, 9362 : 380 - 387
  • [8] Bilingual Lexicon Extraction using Locally Weighted Linear Regression from Comparable Corpora
    Zhang, Chunyue
    Zhao, Tiejun
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 13 - 16
  • [9] Efficient bilingual lexicon extraction from comparable corpora based on formal concepts analysis
    Chebel, Mohamed
    Latiri, Chiraz
    Gaussier, Eric
    NATURAL LANGUAGE ENGINEERING, 2023, 29 (01) : 138 - 161
  • [10] Combining Lexical Context with Pseudo-alignment for Bilingual Lexicon Extraction from Comparable Corpora
    Li, Bo
    Zhu, Qunyan
    He, Tingting
    Chen, Qianjun
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 223 - 233