Combining Discourse Markers and Cross-lingual Embeddings for Synonym-Antonym Classification

被引:0
|
作者
Roth, Michael [1 ]
Upadhyay, Shyam [2 ]
机构
[1] Univ Stuttgart, Inst Nat Language Proc, Stuttgart, Germany
[2] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is well-known that distributional semantic approaches have difficulty in distinguishing between synonyms and antonyms (Grefenstette, 1992; Pado and Lapata, 2003). Recent work has shown that supervision available in English for this task (e.g., lexical resources) can be transferred to other languages via cross-lingual word embeddings. However, this kind of transfer misses monolingual distributional information available in a target language, such as contrast relations that are indicative of antonymy (e.g., hot...while...cold). In this work, we improve the transfer by exploiting monolingual information, expressed in the form of co-occurrences with discourse markers that convey contrast. Our approach makes use of less than a dozen markers, which can easily be obtained for many languages. Compared to a baseline using only cross-lingual embeddings, we show absolute improvements of 410% F-1-score in Vietnamese and Hindi.
引用
收藏
页码:3899 / 3905
页数:7
相关论文
共 50 条
  • [41] A Comparative Study of Cross-Lingual Sentiment Classification
    Wan, Xiaojun
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 24 - 31
  • [42] Cross-lingual sentiment classification with stacked autoencoders
    Guangyou Zhou
    Zhiyuan Zhu
    Tingting He
    Xiaohua Tony Hu
    Knowledge and Information Systems, 2016, 47 : 27 - 44
  • [43] Czech Dataset for Cross-lingual Subjectivity Classification
    Priban, Pavel
    Steinberger, Josef
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1381 - 1391
  • [44] Cross-lingual sentiment classification with stacked autoencoders
    Zhou, Guangyou
    Zhu, Zhiyuan
    He, Tingting
    Hu, Xiaohua Tony
    KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 47 (01) : 27 - 44
  • [45] A cross-lingual video classification using subtitles
    El Kah, Anoual
    Zeroual, Imad
    2022 2ND INTERNATIONAL CONFERENCE ON INNOVATIVE RESEARCH IN APPLIED SCIENCE, ENGINEERING AND TECHNOLOGY (IRASET'2022), 2022, : 703 - 707
  • [46] Revisiting Machine Translation for Cross-lingual Classification
    Artetxe, Mikel
    Goswami, Vedanuj
    Bhosale, Shruti
    Fan, Angela
    Zettlemoyer, Luke
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 6489 - 6499
  • [47] Active Learning for Cross-Lingual Sentiment Classification
    Li, Shoushan
    Wang, Rong
    Liu, Huanhuan
    Huang, Chu-Ren
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 236 - 246
  • [48] English-Malay Word Embeddings Alignment for Cross-lingual Emotion Classification with Hierarchical Attention Network
    Hao, Lim Ying
    Yan, Jasy Liew Suet
    PROCEEDINGS OF THE 12TH WORKSHOP ON COMPUTATIONAL APPROACHES TO SUBJECTIVITY, SENTIMENT & SOCIAL MEDIA ANALYSIS, 2022, : 113 - 124
  • [49] Cross-lingual and cross-domain discourse segmentation of entire documents
    Braud, Chloe
    Lacroix, Ophelie
    Sogaard, Anders
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 237 - 243
  • [50] Do We Really Need Fully Unsupervised Cross-Lingual Embeddings?
    Vulic, Ivan
    Glavas, Goran
    Reichart, Roi
    Korhonen, Anna
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 4407 - 4418