A new transfer learning framework with application to model-agnostic multi-task learning

被引：7

作者：

Gupta, Sunil ^{[1
]}

Rana, Santu ^{[1
]}

Saha, Budhaditya ^{[1
]}

Phung, Dinh ^{[1
]}

Venkatesh, Svetha ^{[1
]}

机构：

[1] Deakin Univ, Ctr Pattern Recognit & Data Analyt PRaDA, Geelong Waurn Ponds Campus, Waurn Ponds, Vic, Australia

来源：

KNOWLEDGE AND INFORMATION SYSTEMS | 2016年 / 49卷 / 03期

关键词：

Multi-task learning; Model-agnostic framework; Meta algorithm; Classification; Regression; CLASSIFICATION;

D O I：

10.1007/s10115-016-0926-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning from small number of examples is a challenging problem in machine learning. An effective way to improve the performance is through exploiting knowledge from other related tasks. Multi-task learning (MTL) is one such useful paradigm that aims to improve the performance through jointly modeling multiple related tasks. Although there exist numerous classification or regression models in machine learning literature, most of the MTL models are built around ridge or logistic regression. There exist some limited works, which propose multi-task extension of techniques such as support vector machine, Gaussian processes. However, all these MTL models are tied to specific classification or regression algorithms and there is no single MTL algorithm that can be used at a meta level for any given learning algorithm. Addressing this problem, we propose a generic, model-agnostic joint modeling framework that can take any classification or regression algorithm of a practitioner's choice (standard or custom-built) and build its MTL variant. The key observation that drives our framework is that due to small number of examples, the estimates of task parameters are usually poor, and we show that this leads to an under-estimation of task relatedness between any two tasks with high probability. We derive an algorithm that brings the tasks closer to their true relatedness by improving the estimates of task parameters. This is achieved by appropriate sharing of data across tasks. We provide the detail theoretical underpinning of the algorithm. Through our experiments with both synthetic and real datasets, we demonstrate that the multi-task variants of several classifiers/regressors (logistic regression, support vector machine, K-nearest neighbor, Random Forest, ridge regression, support vector regression) convincingly outperform their single-task counterparts. We also show that the proposed model performs comparable or better than many state-of-the-art MTL and transfer learning baselines.

引用

页码：933 / 973

页数：41

共 50 条

[41] Multi-task gradient descent for multi-task learning
Lu Bai
Yew-Soon Ong
Tiantian He
Abhishek Gupta
Memetic Computing, 2020, 12 : 355 - 369
[42] Multi-task gradient descent for multi-task learning
Bai, Lu
Ong, Yew-Soon
He, Tiantian
Gupta, Abhishek
MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
[43] Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
Vuorio, Risto
Sun, Shao-Hua
Hu, Hexiang
Lim, Joseph J.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[44] Model-Protected Multi-Task Learning
Liang, Jian
Liu, Ziqi
Zhou, Jiayu
Jiang, Xiaoqian
Zhang, Changshui
Wang, Fei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) : 1002 - 1019
[45] Multi-Task Clustering with Model Relation Learning
Zhang, Xiaotong
Zhang, Xianchao
Liu, Han
Luo, Jiebo
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3132 - 3140
[46] Meta weight learning via model-agnostic meta-learning
Xu, Zhixiong
Chen, Xiliang
Tang, Wei
Lai, Jun
Cao, Lei
NEUROCOMPUTING, 2021, 432 : 124 - 132
[47] Multi-Task Model and Feature Joint Learning
Li, Ya
Tian, Xinmei
Liu, Tongliang
Tao, Dacheng
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3643 - 3649
[48] Theoretical Convergence of Multi-Step Model-Agnostic Meta-Learning
Ji, Kaiyi
Yang, Junjie
Liang, Yingbin
JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
[49] Revisiting model-agnostic private learning: Faster rates and active learning
Liu, Chong
Zhu, Yuqing
Chaudhuri, Kamalika
Wang, Yu-Xiang
Journal of Machine Learning Research, 2021, 22
[50] DMTMV: A Unified Learning Framework for Deep Multi-Task Multi-View Learning
Wu, Yi-Feng
Zhan, De-Chuan
Jiang, Yuan
2018 9TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK), 2018, : 49 - 56

← 1 2 3 4 5 →