CrossAligner & Co: Zero-Shot Transfer Methods for Task-Oriented Cross-lingual Natural Language Understanding

被引:0
|
作者
Gritta, Milan [1 ]
Hu, Ruoyu [1 ,2 ]
Iacobacci, Ignacio [1 ]
机构
[1] Huawei Noahs Ark Lab, London, England
[2] Imperial Coll London, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Task-oriented personal assistants enable people to interact with a host of devices and services using natural language. One of the challenges of making neural dialogue systems available to more users is the lack of training data for all but a few languages. Zero-shot methods try to solve this issue by acquiring task knowledge in a high-resource language such as English with the aim of transferring it to the low-resource language(s). To this end, we introduce CrossAligner, the principal method of a variety of effective approaches for zero-shot cross-lingual transfer based on learning alignment from unlabelled parallel data. We present a quantitative analysis of individual methods as well as their weighted combinations, several of which exceed state-of-the-art (SOTA) scores as evaluated across nine languages, fifteen test sets and three benchmark multilingual datasets. A detailed qualitative error analysis of the best methods shows that our fine-tuned language models can zero-shot transfer the task knowledge better than anticipated.
引用
收藏
页码:4048 / 4061
页数:14
相关论文
共 50 条
  • [41] Cross-Lingual BERT Transformation for Zero-Shot Dependency Parsing
    Wang, Yuxuan
    Che, Wanxiang
    Guo, Jiang
    Liu, Yijia
    Liu, Ting
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5721 - 5727
  • [42] Zero-Shot Learning for Cross-Lingual News Sentiment Classification
    Pelicon, Andraz
    Pranjic, Marko
    Miljkovic, Dragana
    Skrlj, Blaz
    Pollak, Senja
    APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [43] Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
    Huang, Po-Yao
    Patrick, Mandela
    Hu, Junjie
    Neubig, Graham
    Metze, Florian
    Hauptmann, Alexander
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 2443 - 2459
  • [44] Zero-Shot Cross-Lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders
    Chen, Guanhua
    Ma, Shuming
    Chen, Yun
    Dong, Li
    Zhang, Dongdong
    Pan, Jia
    Wang, Wenping
    Wei, Furu
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 15 - 26
  • [45] Soft Layer Selection with Meta-Learning for Zero-Shot Cross-Lingual Transfer
    Xu, Weijia
    Haider, Batool
    Krone, Jason
    Mansour, Saab
    1ST WORKSHOP ON META LEARNING AND ITS APPLICATIONS TO NATURAL LANGUAGE PROCESSING (METANLP 2021), 2021, : 11 - 18
  • [46] Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation
    Chen, Guanhua
    Ma, Shuming
    Chen, Yun
    Zhang, Dongdong
    Pan, Jia
    Wang, Wenping
    Wei, Furu
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 142 - 157
  • [47] BAD-X: Bilingual Adapters Improve Zero-Shot Cross-Lingual Transfer
    Parovic, Marinela
    Glavas, Goran
    Vulic, Ivan
    Korhonen, Anna
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1791 - 1799
  • [48] ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language Models
    Mekala, Dheeraj
    Wolfe, Jason
    Roy, Subhro
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 5792 - 5799
  • [49] Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages
    Adeyemi, Mofetoluwa
    Oladipo, Akintunde
    Pradeep, Ronak
    Lin, Jimmy
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2: SHORT PAPERS, 2024, : 650 - 656
  • [50] Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency Parsing
    Shi, Freda
    Gimpel, Kevin
    Livescu, Karen
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6547 - 6563