Bilingual Lexicon Extraction from Comparable Corpora Based on Closed Concepts Mining

被引:3
|
作者
Chebel, Mohamed [1 ]
Latiri, Chiraz [1 ]
Gaussier, Eric [2 ]
机构
[1] Univ Tunis El Manar, Fac Sci Tunis, Res Lab LIPAH, Tunis, Tunisia
[2] Univ Joseph Fourier, Res Lab LIG, Grenoble I, AMA Grp, Grenoble, France
关键词
D O I
10.1007/978-3-319-57454-7_46
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose to complement the context vectors used in bilingual lexicon extraction from comparable corpora with concept vectors, that aim at capturing all the words related to the concepts associated with a given word. This allows one to rely on a representation that is less sparse, especially in specialized domains where the use of a general bilingual lexicon leaves many words untranslated. The concept vectors we are considering are based on closed concepts mining developed in Formal Concept Analysis (FCA). The obtained results on two different comparable corpora show that enriching context vectors with concept vectors leads to lexicons of higher quality, especially in specialized domains.
引用
收藏
页码:586 / 598
页数:13
相关论文
共 50 条
  • [1] Efficient bilingual lexicon extraction from comparable corpora based on formal concepts analysis
    Chebel, Mohamed
    Latiri, Chiraz
    Gaussier, Eric
    NATURAL LANGUAGE ENGINEERING, 2023, 29 (01) : 138 - 161
  • [2] Addressing polysemy in bilingual lexicon extraction from comparable corpora
    Fiser, Darja
    Ljubesic, Nikola
    Kubelka, Ozren
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3031 - 3035
  • [3] Bilingual Lexicon Extraction with Forced Correlation from Comparable Corpora
    Zhang, Chunyue
    Zhao, Tiejun
    NEURAL INFORMATION PROCESSING, PT II, 2015, 9490 : 528 - 535
  • [4] Adaptive Dictionary for Bilingual Lexicon Extraction from Comparable Corpora
    Hazem, Amir
    Morin, Emmanuel
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 288 - 292
  • [5] Looking at Unbalanced Specialized Comparable Corpora for Bilingual Lexicon Extraction
    Morin, Emmanuel
    Hazem, Amir
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1284 - 1293
  • [6] Exploiting unbalanced specialized comparable corpora for bilingual lexicon extraction
    Morin, Emmanuel
    Hazem, Amir
    NATURAL LANGUAGE ENGINEERING, 2016, 22 (04) : 575 - 601
  • [7] Iterative Bilingual Lexicon Extraction from Comparable Corpora with Topical and Contextual Knowledge
    Chu, Chenhui
    Nakazawa, Toshiaki
    Kurohashi, Sadao
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2014, PART II, 2014, 8404 : 296 - 309
  • [8] Bilingual Lexicon Extraction with Temporal Distributed Word Representation from Comparable Corpora
    Zhang, Chunyue
    Zhao, Tiejun
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2015, 2015, 9362 : 380 - 387
  • [9] Bilingual Lexicon Extraction using Locally Weighted Linear Regression from Comparable Corpora
    Zhang, Chunyue
    Zhao, Tiejun
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 13 - 16
  • [10] Combining Lexical Context with Pseudo-alignment for Bilingual Lexicon Extraction from Comparable Corpora
    Li, Bo
    Zhu, Qunyan
    He, Tingting
    Chen, Qianjun
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 223 - 233