Continual and Multi-Task Architecture Search

被引:0
|
作者
Pasunuru, Ramakanth [1 ]
Bansal, Mohit [1 ]
机构
[1] Univ N Carolina, Chapel Hill, NC 27515 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Architecture search is the process of automatically learning the neural model or cell structure that best suits the given task. Recently, this approach has shown promising performance improvements (on language modeling and image classification) with reasonable training speed, using a weight sharing strategy called Efficient Neural Architecture Search (ENAS). In our work, we first introduce a novel continual architecture search (CAS) approach, so as to continually evolve the model parameters during the sequential training of several tasks, without losing performance on previously learned tasks (via block-sparsity and orthogonality constraints), thus enabling life-long learning. Next, we explore a multi-task architecture search (MAS) approach over ENAS for finding a unified, single cell structure that performs well across multiple tasks (via joint controller rewards), and hence allows more generalizable transfer of the cell structure knowledge to an unseen new task. We empirically show the effectiveness of our sequential continual learning and parallel multi-task learning based architecture search approaches on diverse sentence-pair classification tasks (GLUE) and multimodal-generation based video captioning tasks. Further, we present several ablations and analyses on the learned cell structures.(1)
引用
收藏
页码:1911 / 1922
页数:12
相关论文
共 50 条
  • [1] Deep multi-task learning with flexible and compact architecture search
    Jiejie Zhao
    Weifeng Lv
    Bowen Du
    Junchen Ye
    Leilei Sun
    Guixi Xiong
    International Journal of Data Science and Analytics, 2023, 15 : 187 - 199
  • [2] Deep multi-task learning with flexible and compact architecture search
    Zhao, Jiejie
    Lv, Weifeng
    Du, Bowen
    Ye, Junchen
    Sun, Leilei
    Xiong, Guixi
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2023, 15 (02) : 187 - 199
  • [3] Multi-Task Learning for Multi-Objective Evolutionary Neural Architecture Search
    Cai, Ronghong
    Luo, Jianping
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 1680 - 1687
  • [4] Cascaded Multi-task Adaptive Learning Based on Neural Architecture Search
    Gao, Yingying
    Zhang, Shilei
    Cui, Zihao
    Deng, Chao
    Feng, Junlan
    INTERSPEECH 2023, 2023, : 246 - 250
  • [5] Conformer Space Neural Architecture Search for Multi-Task Audio Separation
    Lu, Shun
    Wang, Yang
    Yao, Peng
    Li, Chenxing
    Tan, Jianchao
    Deng, Feng
    Wang, Xiaorui
    Song, Chengru
    INTERSPEECH 2022, 2022, : 5358 - 5362
  • [6] Lifelong CycleGAN for continual multi-task image restoration
    Li, Yuping
    Nie, Xiangli
    Diao, Wenhui
    Zheng, Suiwu
    PATTERN RECOGNITION LETTERS, 2022, 153 : 183 - 189
  • [7] Multi-task Graph Neural Architecture Search with Task-aware Collaboration and Curriculum
    Qin, Yijian
    Wang, Xin
    Zhang, Ziwei
    Chen, Hong
    Zhu, Wenwu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Latent Multi-Task Architecture Learning
    Ruder, Sebastian
    Bingel, Joachim
    Augenstein, Isabelle
    Sogaard, Anders
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4822 - 4829
  • [9] Multi-Task Policy Search for Robotics
    Deisenroth, Marc Peter
    Englert, Peter
    Peters, Jan
    Fox, Dieter
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 3876 - 3881
  • [10] Identification of Plant Disease Based on Multi-Task Continual Learning
    Zhao, Yafeng
    Jiang, Chenglong
    Wang, Dongdong
    Liu, Xiaolu
    Song, Wenhua
    Hu, Junfeng
    AGRONOMY-BASEL, 2023, 13 (12):