Towards possibilistic reinforcement learning algorithms

被引:0
|
作者
Sabbadin, R [1 ]
机构
[1] INRA, Unite Biometrie & Intelligence Artificielle, F-31329 Castanet Tolosan, France
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a framework and algorithms for reinforcement learning in sequential decision problems under uncertainty in which the rewards are qualitative, and/or am temporarily aggregated by a "minimum" instead of a sum as in the classical Markov Decision Processes (MDP) framework. The framework is based on a "possibilistic" version of Markov Decision Processes and the learning algorithms are based on indirect methods in which the possibilistic model of the problem is learned while the problem itself is solved, using Dynamic Prong.
引用
收藏
页码:404 / 407
页数:4
相关论文
共 50 条
  • [31] EPOCH-INCREMENTAL REINFORCEMENT LEARNING ALGORITHMS
    Zajdel, Roman
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2013, 23 (03) : 623 - 635
  • [32] Parallelization of Reinforcement Learning Algorithms for Video Games
    Kopel, Marek
    Szczurek, Witold
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 195 - 207
  • [33] Universal Reinforcement Learning Algorithms: Survey and Experiments
    Aslanides, John
    Leike, Jan
    Hutter, Marcus
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1403 - 1410
  • [34] Application of Reinforcement Learning in Dynamic Pricing Algorithms
    Wang Jintian
    Zhou Lei
    2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS ( ICAL 2009), VOLS 1-3, 2009, : 419 - 423
  • [35] Offline Evaluation of Online Reinforcement Learning Algorithms
    Mandel, Travis
    Liu, Yun-En
    Brunskill, Emma
    Popovic, Zoran
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1926 - 1933
  • [36] Reinforcement Learning Models and Algorithms for Diabetes Management
    Yau, Kok-Lim Alvin
    Chong, Yung-Wey
    Fan, Xiumei
    Wu, Celimuge
    Saleem, Yasir
    Lim, Phei-Ching
    IEEE ACCESS, 2023, 11 : 28391 - 28415
  • [37] Reinforcement Learning Algorithms with Selector, Tuner, or Estimator
    Ala’eddin Masadeh
    Zhengdao Wang
    Ahmed E. Kamal
    Arabian Journal for Science and Engineering, 2024, 49 : 4081 - 4095
  • [38] Reinforcement learning for online control of evolutionary algorithms
    Eiben, A. E.
    Horvath, Mark
    Kowalczyk, Wojtek
    Schut, Martijn C.
    ENGINEERING SELF-ORGANISING SYSTEMS, 2007, 4335 : 151 - +
  • [39] Interactive Teaching Algorithms for Inverse Reinforcement Learning
    Kamalaruban, Parameswaran
    Devidze, Rati
    Cevher, Volkan
    Singla, Adish
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2692 - 2700
  • [40] Quantum Algorithms for Reinforcement Learning with a Generative Model
    Wang, Daochen
    Sundaram, Aarthi
    Kothari, Robin
    Kapoor, Ashish
    Roetteler, Martin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139