UDapter: Typology-based Language Adapters for Multilingual Dependency Parsing and Sequence Labeling

被引:6
|
作者
Ustun, Ahmet [1 ]
Bisazza, Arianna [1 ]
Bouma, Gosse [1 ]
van Noord, Gertjan [1 ]
机构
[1] Univ Groningen, Ctr Language & Cognit, Groningen, Netherlands
关键词
Computational linguistics - Natural language processing systems - Zero-shot learning;
D O I
10.1162/coli_a_00443
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in multilingual language modeling have brought the idea of a truly universal parser closer to reality. However, such models are still not immune to the "curse of multilinguality": Cross-language interference and restrained model capacity remain major obstacles. To address this, we propose a novel language adaptation approach by introducing contextual language adapters to a multilingual parser. Contextual language adapters make it possible to learn adapters via language embeddings while sharing model parameters across languages based on contextual parameter generation. Moreover, our method allows for an easy but effective integration of existing linguistic typology features into the parsing model. Because not all typological features are available for every language, we further combine typological feature prediction with parsing in a multi-task model that achieves very competitive parsing performance without the need for an external prediction system for missing features.The resulting parser, UDapter, can be used for dependency parsing as well as sequence labeling tasks such as POS tagging, morphological tagging, and NER. In dependency parsing, it outperforms strong monolingual and multilingual baselines on the majority of both high-resource and low-resource (zero-shot) languages, showing the success of the proposed adaptation approach. In sequence labeling tasks, our parser surpasses the baseline on high resource languages, and performs very competitively in a zero-shot setting. Our in-depth analyses show that adapter generation via typological features of languages is key to this success.(1)
引用
收藏
页码:555 / 592
页数:38
相关论文
共 16 条
  • [1] UDapter: Language Adaptation for Truly Universal Dependency Parsing
    Ustun, Ahmet
    Bisazza, Arianna
    Bouma, Gosse
    Van Noord, Gertjan
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2302 - 2315
  • [2] Viable Dependency Parsing as Sequence Labeling
    Strzyz, Michalina
    Vilares, David
    Gomez-Rodriguez, Carlos
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 717 - 723
  • [3] Typology Guided Multilingual Position Representations: Case on Dependency Parsing
    Ji, Tao
    Wu, Yuanbin
    Wang, Xiaoling
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 13524 - 13541
  • [4] Typology-based semantic labeling of numeric tabular data
    Alobaid, Ahmad
    Kacprzak, Emilia
    Corcho, Oscar
    SEMANTIC WEB, 2021, 12 (01) : 5 - 20
  • [5] Multilingual dependency-based syntactic and semantic parsing
    Che, Wanxiang
    Li, Zhenghua
    Li, Yongqiang
    Guo, Yuhang
    Qin, Bing
    Liu, Ting
    CoNLL- 2009: Shared Task - Proceedings of the Thirteenth Conference on Computational Natural Language Learning, CoNLL: Shared Task, 2009, : 49 - 54
  • [6] Semantic Role Labeling for Biomedical Corpus Based on Dependency Parsing
    Han, Lei
    Ji, Donghong
    Ren, Han
    2016 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE AND INTERNET TECHNOLOGY (CII 2016), 2016, : 119 - 124
  • [7] Improving Graph-Based Dependency Parsing Models With Dependency Language Models
    Zhang, Min
    Chen, Wenliang
    Duan, Xiangyu
    Zhang, Rong
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (11): : 2313 - 2323
  • [8] Multimodal Graph-Based Dependency Parsing of Natural Language
    Salama, Amr Rekaby
    Menzel, Wolfgang
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 22 - 31
  • [9] BERT-Based Sequence Labelling Approach for Dependency Parsing in Tamil
    Kumar, C. S. Ayush
    Das Maharana, Advaith
    Krishnan, Srinath Murali
    Premjith, B.
    Soman, K. P.
    PROCEEDINGS OF THE SECOND WORKSHOP ON SPEECH AND LANGUAGE TECHNOLOGIES FOR DRAVIDIAN LANGUAGES (DRAVIDIANLANGTECH 2022), 2022, : 1 - 8
  • [10] RNN-Based Sequence-Preserved Attention for Dependency Parsing
    Zhou, Yi
    Zhou, Junying
    Liu, Lu
    Feng, Jiangtao
    Peng, Haoyuan
    Zheng, Xiaoqing
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5738 - 5745