Information-theoretic Task Selection for Meta-Reinforcement Learning

被引:0
|
作者
Gutierrez, Ricardo Luna [1 ]
Leonetti, Matteo [1 ]
机构
[1] Univ Leeds, Sch Comp, Leeds, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In Meta-Reinforcement Learning (meta-RL) an agent is trained on a set of tasks to prepare for and learn faster in new, unseen, but related tasks. The training tasks are usually hand-crafted to be representative of the expected distribution of test tasks and hence all used in training. We show that given a set of training tasks, learning can be both faster and more effective (leading to better performance in the test tasks), if the training tasks are appropriately selected. We propose a task selection algorithm, Information-Theoretic Task Selection (ITTS), based on information theory, which optimizes the set of tasks used for training in meta-RL, irrespectively of how they are generated. The algorithm establishes which training tasks are both sufficiently relevant for the test tasks, and different enough from one another. We reproduce different meta-RL experiments from the literature and show that ITTS improves the final performance in all of them.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] An information-theoretic approach to curiosity-driven reinforcement learning
    Still, Susanne
    Precup, Doina
    THEORY IN BIOSCIENCES, 2012, 131 (03) : 139 - 148
  • [22] Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning
    Wang, Mingyang
    Bing, Zhenshan
    Yao, Xiangtong
    Wang, Shuai
    Kai, Huang
    Su, Hang
    Yang, Chenguang
    Knoll, Alois
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10157 - 10165
  • [23] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
    Yuan, Haoqi
    Lu, Zongqing
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [24] Reinforcement Learning-Based Visual Navigation With Information-Theoretic Regularization
    Wu, Qiaoyun
    Xu, Kai
    Wang, Jun
    Xu, Mingliang
    Gong, Xiaoxi
    Manocha, Dinesh
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 731 - 738
  • [25] Generalization Bounds for Meta-Learning: An Information-Theoretic Analysis
    Chen, Qi
    Shui, Changjian
    Marchand, Mario
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [26] Information-Theoretic Generalization Bounds for Meta-Learning and Applications
    Jose, Sharu Theresa
    Simeone, Osvaldo
    ENTROPY, 2021, 23 (01) : 1 - 28
  • [27] Information-Theoretic Odometry Learning
    Sen Zhang
    Jing Zhang
    Dacheng Tao
    International Journal of Computer Vision, 2022, 130 : 2553 - 2570
  • [28] Information-theoretic competitive learning
    Kamimura, R
    IASTED: PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON MODELLING AND SIMULATION, 2003, : 359 - 365
  • [29] Information-Theoretic Odometry Learning
    Zhang, Sen
    Zhang, Jing
    Tao, Dacheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (11) : 2553 - 2570
  • [30] Adaptable Image Quality Assessment Using Meta-Reinforcement Learning of Task Amenability
    Saeed, Shaheer U.
    Fu, Yunguan
    Stavrinides, Vasilis
    Baum, Zachary M. C.
    Yang, Qianye
    Rusu, Mirabela
    Fan, Richard E.
    Sonn, Geoffrey A.
    Noble, J. Alison
    Barratt, Dean C.
    Hu, Yipeng
    SIMPLIFYING MEDICAL ULTRASOUND, 2021, 12967 : 191 - 201