Combining Discourse Markers and Cross-lingual Embeddings for Synonym-Antonym Classification

被引:0
|
作者
Roth, Michael [1 ]
Upadhyay, Shyam [2 ]
机构
[1] Univ Stuttgart, Inst Nat Language Proc, Stuttgart, Germany
[2] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is well-known that distributional semantic approaches have difficulty in distinguishing between synonyms and antonyms (Grefenstette, 1992; Pado and Lapata, 2003). Recent work has shown that supervision available in English for this task (e.g., lexical resources) can be transferred to other languages via cross-lingual word embeddings. However, this kind of transfer misses monolingual distributional information available in a target language, such as contrast relations that are indicative of antonymy (e.g., hot...while...cold). In this work, we improve the transfer by exploiting monolingual information, expressed in the form of co-occurrences with discourse markers that convey contrast. Our approach makes use of less than a dozen markers, which can easily be obtained for many languages. Compared to a baseline using only cross-lingual embeddings, we show absolute improvements of 410% F-1-score in Vietnamese and Hindi.
引用
收藏
页码:3899 / 3905
页数:7
相关论文
共 50 条
  • [31] Learning Tibetan-Chinese cross-lingual word embeddings
    Ma, Wei
    Yu, Hongzhi
    Zhao, Kun
    Zhao, Deshun
    2019 15TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG 2019), 2019, : 49 - 53
  • [32] A Variational Autoencoding Approach for Inducing Cross-lingual Word Embeddings
    Wei, Liangchen
    Deng, Zhi-Hong
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4165 - 4171
  • [33] Cross-Lingual Word Representations via Spectral Graph Embeddings
    Oshikiri, Takamasa
    Fukui, Kazuki
    Shimodaira, Hidetoshi
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 493 - 498
  • [34] Cross-Lingual Taxonomy Alignment with Bilingual Knowledge Graph Embeddings
    Wu, Tianxing
    Zhang, Du
    Zhang, Lei
    Qi, Guilin
    SEMANTIC TECHNOLOGY, JIST 2017, 2017, 10675 : 251 - 258
  • [35] A Study of Efficacy of Cross-lingual Word Embeddings for Indian Languages
    Khatri, Jyotsana
    Murthy, Rudra
    Bhattacharyya, Pushpak
    PROCEEDINGS OF THE 7TH ACM IKDD CODS AND 25TH COMAD (CODS-COMAD 2020), 2020, : 347 - 348
  • [36] A Closer Look on Unsupervised Cross-lingual Word Embeddings Mapping
    Plucinski, Kamil
    Lango, Mateusz
    Zimniewicz, Michal
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5555 - 5562
  • [37] Unsupervised cross-lingual word embeddings learning with adversarial training
    Li, Yuling
    Zhang, Yuhong
    Li, Peipei
    Hu, Xuegang
    2019 10TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK 2019), 2019, : 150 - 156
  • [38] Evaluating Sub-word embeddings in cross-lingual models
    Parizi, Ali Hakimi
    Cook, Paul
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2712 - 2719
  • [39] Contextualized Embeddings Encode Monolingual and Cross-lingual Knowledge of Idiomaticity
    Fakharian, Samin
    Cook, Paul
    MWE 2021: THE 17TH WORKSHOP ON MULTIWORD EXPRESSIONS, 2021, : 23 - 32
  • [40] Cross-Lingual Transfer for Hindi Discourse Relation Identification
    Dahiya, Anirudh
    Shrivastava, Manish
    Sharma, Dipti Misra
    TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 240 - 247