Multi-Task Learning in Natural Language Processing: An Overview

被引：10

作者：

Chen, Shijie ^{[1
]}

Zhang, Yu ^{[2
]}

Yang, Qiang ^{[3
]}

机构：

[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH USA

[2] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen, Peoples R China

[3] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China

来源：

ACM COMPUTING SURVEYS | 2024年 / 56卷 / 12期

关键词：

Multi-task learning;

D O I：

10.1145/3663363

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Deep learning approaches have achieved great success in the field of Natural Language Processing (NLP). However, directly training deep neural models often suffer from overfitting and data scarcity problems that are pervasive in NLP tasks. In recent years, Multi-Task Learning (MTL), which can leverage useful information of related tasks to achieve simultaneous performance improvement on these tasks, has been used to handle these problems. In this article, we give an overview of the use of MTL in NLP tasks. We first review MTL architectures used in NLP tasks and categorize them into four classes, including parallel architecture, hierarchical architecture, modular architecture, and generative adversarial architecture. Then we present optimization techniques on loss construction, gradient regularization, data sampling, and task scheduling to properly train a multi-task model. After presenting applications of MTL in a variety of NLP tasks, we introduce some benchmark datasets. Finally, we make a conclusion and discuss several possible research directions in this field.

引用

页数：32

共 50 条

[31] Boosted multi-task learning
Olivier Chapelle
Pannagadatta Shivaswamy
Srinivas Vadrevu
Kilian Weinberger
Ya Zhang
Belle Tseng
Machine Learning, 2011, 85 : 149 - 173
[32] On Partial Multi-Task Learning
He, Yi
Wu, Baijun
Wu, Di
Wu, Xindong
ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1174 - 1181
[33] Pareto Multi-Task Learning
Lin, Xi
Zhen, Hui-Ling
Li, Zhenhua
Zhang, Qingfu
Kwong, Sam
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[34] Federated Multi-Task Learning
Smith, Virginia
Chiang, Chao-Kai
Sanjabi, Maziar
Talwalkar, Ameet
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[35] Asynchronous Multi-Task Learning
Baytas, Inci M.
Yan, Ming
Jain, Anil K.
Zhou, Jiayu
2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 11 - 20
[36] Calibrated Multi-Task Learning
Nie, Feiping
Hu, Zhanxuan
Li, Xuelong
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2012 - 2021
[37] Boosted multi-task learning
Chapelle, Olivier
Shivaswamy, Pannagadatta
Vadrevu, Srinivas
Weinberger, Kilian
Zhang, Ya
Tseng, Belle
MACHINE LEARNING, 2011, 85 (1-2) : 149 - 173
[38] Distributed Multi-Task Learning
Wang, Jialei
Kolar, Mladen
Srebro, Nathan
ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 751 - 760
[39] Parallel Multi-Task Learning
Zhang, Yu
2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 629 - 638
[40] Learning Sparse Task Relations in Multi-Task Learning
Zhang, Yu
Yang, Qiang
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2914 - 2920

← 1 2 3 4 5 →