Feature discovery in reinforcement learning using genetic programming

被引:0
|
作者
Girgin, Sertan [1 ]
Preux, Philippe [1 ,2 ]
机构
[1] INRIA Futurs Lille, Team Project SequeL, Lille, France
[2] Univ Lille, CNRS, LIFL, UMR, F-59655 Villeneuve Dascq, France
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of reinforcement learning is to find a policy that maximizes the expected reward accumulated by an agent over time based on its interactions with the environment; to this end, a function of the state of the agent has to be learned. It is often the case that states are better characterized by a set of features. However, finding a "good" set of features is generally a tedious task which requires a good domain knowledge. In this paper, we propose a genetic programming based approach for feature discovery in reinforcement learning. A population of individuals, each representing a set of features, is evolved, and individuals are evaluated by their average performance on short reinforcement learning trials. The results of experiments conducted on several benchmark problems demonstrate that the resulting features allow the agent to learn better policies in a reduced amount of episodes.
引用
收藏
页码:218 / +
页数:2
相关论文
共 50 条
  • [31] Multi-objective Genetic Programming for Explainable Reinforcement Learning
    Videau, Mathurin
    Leite, Alessandro
    Teytaud, Olivier
    Schoenauer, Marc
    GENETIC PROGRAMMING (EUROGP 2022), 2022, : 278 - 293
  • [32] Genetic network programming with reinforcement learning and its performance evaluation
    Mabu, S
    Hirasawa, K
    Hu, J
    GENETIC AND EVOLUTIONARY COMPUTATION GECCO 2004 , PT 2, PROCEEDINGS, 2004, 3103 : 710 - 711
  • [33] Learning Strategies of Inductive Logic Programming Using Reinforcement Learning
    Isobe, Takeru
    Inoue, Katsumi
    INDUCTIVE LOGIC PROGRAMMING, ILP 2023, 2023, 14363 : 46 - 61
  • [34] Genetic Programming and Reinforcement Learning on Learning Heuristics for Dynamic Scheduling: A Preliminary Comparison
    Xu, Meng
    Mei, Yi
    Zhang, Fangfang
    Zhang, Mengjie
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2024, 19 (02) : 18 - 33
  • [35] Trading rules on stock markets using genetic network programming with reinforcement learning and importance index
    Mabu, Shingo
    Hirasawa, Kotaro
    Furuzuki, Takayuki
    IEEJ Transactions on Electronics, Information and Systems, 2007, 127 (07) : 1061 - 1067
  • [36] A Study of Double-Deck Elevator Systems Using Genetic Network Programming with Reinforcement Learning
    Zhou, Jin
    Yu, Lu
    Mabu, Shingo
    Shimada, Kaoru
    Hirasawa, Kotaro
    Markon, Sandor
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2009, 13 (01) : 35 - 44
  • [37] Multi-Behaviour Robot Control using Genetic Network Programming with Fuzzy Reinforcement Learning
    Wang, W.
    Reyes, N. H.
    Barczak, A. L. C.
    Susnjak, T.
    Sincak, Peter
    ROBOT INTELLIGENCE TECHNOLOGY ANDAPPLICATIONS 3, 2015, 345 : 151 - 158
  • [38] Feature Learning for Image Classification via Multiobjective Genetic Programming
    Shao, Ling
    Liu, Li
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (07) : 1359 - 1371
  • [39] Learning Robust Feature Descriptor for Image Registration With Genetic Programming
    Wu, Yue
    Su, Qingxiu
    Ma, Wenping
    Liu, Shaodi
    Miao, Qiguang
    IEEE ACCESS, 2020, 8 : 39389 - 39402
  • [40] Genetic programming-based feature learning for question answering
    Khodadi, Iman
    Abadeh, Mohammad Saniee
    INFORMATION PROCESSING & MANAGEMENT, 2016, 52 (02) : 340 - 357