Multi-Task Learning in Natural Language Processing: An Overview

被引：10

作者：

Chen, Shijie ^{[1
]}

Zhang, Yu ^{[2
]}

Yang, Qiang ^{[3
]}

机构：

[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH USA

[2] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen, Peoples R China

[3] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China

来源：

ACM COMPUTING SURVEYS | 2024年 / 56卷 / 12期

关键词：

Multi-task learning;

D O I：

10.1145/3663363

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Deep learning approaches have achieved great success in the field of Natural Language Processing (NLP). However, directly training deep neural models often suffer from overfitting and data scarcity problems that are pervasive in NLP tasks. In recent years, Multi-Task Learning (MTL), which can leverage useful information of related tasks to achieve simultaneous performance improvement on these tasks, has been used to handle these problems. In this article, we give an overview of the use of MTL in NLP tasks. We first review MTL architectures used in NLP tasks and categorize them into four classes, including parallel architecture, hierarchical architecture, modular architecture, and generative adversarial architecture. Then we present optimization techniques on loss construction, gradient regularization, data sampling, and task scheduling to properly train a multi-task model. After presenting applications of MTL in a variety of NLP tasks, we introduce some benchmark datasets. Finally, we make a conclusion and discuss several possible research directions in this field.

引用

页数：32

共 50 条

[21] Multi-Task Deep Neural Networks for Natural Language Understanding
Liu, Xiaodong
He, Pengcheng
Chen, Weizhu
Gao, Jianfeng
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4487 - 4496
[22] A Sequential and Intensive Weighted Language Modeling Scheme for Multi-Task Learning-Based Natural Language Understanding
Son, Suhyune
Hwang, Seonjeong
Bae, Sohyeun
Park, Soo Jun
Choi, Jang-Hwan
APPLIED SCIENCES-BASEL, 2021, 11 (07):
[23] Multi-task Learning with Bidirectional Language Models for Text Classification
Yang, Qi
Shang, Lin
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[24] Multi-task Learning of Hierarchical Vision-Language Representation
Duy-Kien Nguyen
Okatani, Takayuki
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10484 - 10493
[25] Multi-Task Learning for Spoken Language Understanding with Shared Slots
Li, Xiao
Wang, Ye-Yi
Tur, Gokhan
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 708 - +
[26] A JOINT MULTI-TASK LEARNING FRAMEWORK FOR SPOKEN LANGUAGE UNDERSTANDING
Li, Changliang
Kong, Cunliang
Zhao, Yan
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6054 - 6058
[27] Multi-Task Learning for Parsing the Alexa Meaning Representation Language
Perera, Vittorio
Chung, Tagyoung
Kollar, Thomas
Strubell, Emma
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5390 - 5397
[28] Hierarchical and Bidirectional Joint Multi-Task Classifiers for Natural Language Understanding
Ji, Xiaoyu
Hu, Wanyang
Liang, Yanyan
MATHEMATICS, 2023, 11 (24)
[29] Learning to Branch for Multi-Task Learning
Guo, Pengsheng
Lee, Chen-Yu
Ulbricht, Daniel
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[30] Learning to Branch for Multi-Task Learning
Guo, Pengsheng
Lee, Chen-Yu
Ulbricht, Daniel
25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,

← 1 2 3 4 5 →