Maximizing learning progress: An internal reward system for development

被引:0
|
作者
Kaplan, F [1 ]
Oudeyer, PY [1 ]
机构
[1] Sony Comp Sci Lab Paris, Dev Robot Grp, F-75005 Paris, France
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This chapter presents a generic internal reward system that drives an agent to increase the complexity of its behavior. This reward system does not reinforce a predefined task. Its purpose is to drive the agent to progress in learning given its embodiment and the environment in which it is placed. The dynamics created by such a system are studied first in a simple environment and then in the context of active vision.
引用
收藏
页码:259 / 270
页数:12
相关论文
共 50 条
  • [1] Work in Progress - Maximizing Student Engagement in a Learning Management System
    Little-Wiles, Julie M.
    Hundley, Stephen P.
    Koehler, Adrie
    2010 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE), 2010,
  • [2] Maximizing the average reward in episodic reinforcement learning tasks
    Reinke, Chris
    Uchibe, Eiji
    Doya, Kenji
    2015 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS), 2015, : 420 - 421
  • [3] Evolution of an Internal Reward Function for Reinforcement Learning
    Zuo, Weiyi
    Pedersen, Joachim Winther
    Risi, Sebastian
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 351 - 354
  • [4] ALTRUISM AND THE INTERNAL REWARD SYSTEM OR THE OPIUM OF THE PEOPLE
    DANIELLI, JF
    JOURNAL OF SOCIAL AND BIOLOGICAL STRUCTURES, 1980, 3 (02) : 87 - 94
  • [5] Effective integration of imitation learning and reinforcement learning by generating internal reward
    Hamahata, Keita
    Taniguchi, Tadahiro
    Sakakibara, Kazutoshi
    Nishikawa, Ikuko
    Tabuchi, Kazuma
    Sawaragi, Tetsuo
    ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, PROCEEDINGS, 2008, : 121 - +
  • [6] Adolescent development of the reward system
    Galvan, Adriana
    FRONTIERS IN HUMAN NEUROSCIENCE, 2010, 4
  • [7] Natural reward learning and limbic system
    Phillips, AG
    Ahn, S
    BEHAVIOURAL PHARMACOLOGY, 2004, 15 (5-6): : A7 - A7
  • [8] Maximizing Service Reward for Queues With Deadlines
    Raviv, Li-On
    Leshem, Amir
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2018, 26 (05) : 2296 - 2308
  • [9] Learning classifier system with average reward reinforcement learning
    Zang, Zhaoxiang
    Li, Dehua
    Wang, Junying
    Xia, Dan
    KNOWLEDGE-BASED SYSTEMS, 2013, 40 : 58 - 71
  • [10] Tolerating faults while maximizing reward
    Aydin, H
    Melhem, R
    Mossé, D
    EUROMICRO RTS 2000: 12TH EUROMICRO CONFERENCE ON REAL-TIME SYSTEMS, PROCEEDINGS, 2000, : 219 - 226