Learning to touch objects through stage-wise deep reinforcement learning

被引:0
|
作者
de La Bourdonnaye, Francois [1 ]
Teuliere, Celine [1 ]
Triesch, Jochen [2 ]
Chateau, Thierry [1 ]
机构
[1] Univ Clermont Auvergne, Pascal Inst, CNRS, UMR6602, Aubiere, France
[2] Frankfurt Inst Adv Studies, Frankfurt, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning complex behaviors through reinforcement learning is particularly challenging when reward is only available upon successful completion of the full behavior. In manipulation robotics, so-called shaping rewards are often used to overcome this problem. However, these usually require human engineering or (partial) world models describing, e.g., the kinematics of the robot or high-level modules for perception. Here we propose an alternative method to learn an object palm-touching task through a weakly-supervised and stage-wise learning of simpler tasks. First, the robot learns to fixate the object with its cameras. Second, the robot learns eye-hand coordination by learning to fixate its end effector. Third, using the previously acquired skills an informative shaping reward can be computed which facilitates efficient learning of the object palm-touching task. We demonstrate in simulation that learning the full task with this shaping reward is comparable to learning with an informative supervised reward.
引用
收藏
页码:7789 / 7794
页数:6
相关论文
共 50 条
  • [31] A learning method for AUV collision avoidance through deep reinforcement learning
    Xu, Jian
    Huang, Fei
    Wu, Di
    Cui, Yunfei
    Yan, Zheping
    Du, Xue
    OCEAN ENGINEERING, 2022, 260
  • [32] Learning Kinematic Feasibility for Mobile Manipulation Through Deep Reinforcement Learning
    Honerkamp, Daniel
    Welschehold, Tim
    Valada, Abhinav
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04): : 6289 - 6296
  • [33] Learning the Aerodynamic Design of Supercritical Airfoils Through Deep Reinforcement Learning
    Li, Runze
    Zhang, Yufei
    Chen, Haixin
    AIAA JOURNAL, 2021, 59 (10) : 3988 - 4001
  • [34] Learning Ball-Balancing Robot through Deep Reinforcement Learning
    Zhou, Yifan
    Lin, Jianghao
    Wang, Shuai
    Zhang, Chong
    2021 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS (ICCCR 2021), 2021, : 1 - 8
  • [35] Maintaining flexibility in smart grid consumption through deep learning and deep reinforcement learning☆
    Gallego, Fernando
    Martin, Cristian
    Diaz, Manuel
    Garrido, Daniel
    ENERGY AND AI, 2023, 13
  • [36] From Reinforcement Learning to Deep Reinforcement Learning: An Overview
    Agostinelli, Forest
    Hocquet, Guillaume
    Singh, Sameer
    Baldi, Pierre
    BRAVERMAN READINGS IN MACHINE LEARNING: KEY IDEAS FROM INCEPTION TO CURRENT STATE, 2018, 11100 : 298 - 328
  • [37] A hybrid learning approach for the stage-wise classification and prediction of COVID-19 X-ray images
    Adimoolam, M.
    Govindharaju, Karthi
    John, A.
    Mohan, Senthilkumar
    Ahmadian, Ali
    Ciano, Tiziana
    EXPERT SYSTEMS, 2022, 39 (04)
  • [38] Pixel-Wise Deep Reinforcement Learning Approach for Ultrasound Image Denoising
    Jarosik, Piotr
    Lewandowski, Marcin
    Klimonda, Ziemowit
    Byra, Michal
    INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS 2021), 2021,
  • [39] Transfer Learning in Deep Reinforcement Learning
    Islam, Tariqul
    Abid, Dm. Mehedi Hasan
    Rahman, Tanvir
    Zaman, Zahura
    Mia, Kausar
    Hossain, Ramim
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL 1, 2023, 447 : 145 - 153
  • [40] A stage-wise application of total quality management through the product life cycle
    Parzinger, M
    Ramarapu, N
    Timmerman, JE
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 1997, 97 (3-4) : 125 - &