Learning to touch objects through stage-wise deep reinforcement learning

被引:0
|
作者
de La Bourdonnaye, Francois [1 ]
Teuliere, Celine [1 ]
Triesch, Jochen [2 ]
Chateau, Thierry [1 ]
机构
[1] Univ Clermont Auvergne, Pascal Inst, CNRS, UMR6602, Aubiere, France
[2] Frankfurt Inst Adv Studies, Frankfurt, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning complex behaviors through reinforcement learning is particularly challenging when reward is only available upon successful completion of the full behavior. In manipulation robotics, so-called shaping rewards are often used to overcome this problem. However, these usually require human engineering or (partial) world models describing, e.g., the kinematics of the robot or high-level modules for perception. Here we propose an alternative method to learn an object palm-touching task through a weakly-supervised and stage-wise learning of simpler tasks. First, the robot learns to fixate the object with its cameras. Second, the robot learns eye-hand coordination by learning to fixate its end effector. Third, using the previously acquired skills an informative shaping reward can be computed which facilitates efficient learning of the object palm-touching task. We demonstrate in simulation that learning the full task with this shaping reward is comparable to learning with an informative supervised reward.
引用
收藏
页码:7789 / 7794
页数:6
相关论文
共 50 条
  • [21] Learning heuristics for weighted CSPs through deep reinforcement learning
    Dingding Chen
    Ziyu Chen
    Zhongshi He
    Junsong Gao
    Zhizhuo Su
    Applied Intelligence, 2023, 53 : 8844 - 8863
  • [22] Learning heuristics for weighted CSPs through deep reinforcement learning
    Chen, Dingding
    Chen, Ziyu
    He, Zhongshi
    Gao, Junsong
    Su, Zhizhuo
    APPLIED INTELLIGENCE, 2023, 53 (08) : 8844 - 8863
  • [23] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Morales, Eduardo F.
    Murrieta-Cid, Rafael
    Becerra, Israel
    Esquivel-Basaldua, Marco A.
    INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 773 - 805
  • [24] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Eduardo F. Morales
    Rafael Murrieta-Cid
    Israel Becerra
    Marco A. Esquivel-Basaldua
    Intelligent Service Robotics, 2021, 14 : 773 - 805
  • [25] Deep Learning-Based Stage-Wise Risk Stratification for Early Lung Adenocarcinoma in CT Images: A Multi-Center Study
    Gong, Jing
    Liu, Jiyu
    Li, Haiming
    Zhu, Hui
    Wang, Tingting
    Hu, Tingdan
    Li, Menglei
    Xia, Xianwu
    Hu, Xianfang
    Peng, Weijun
    Wang, Shengping
    Tong, Tong
    Gu, Yajia
    CANCERS, 2021, 13 (13)
  • [26] Learn to Steer through Deep Reinforcement Learning
    Wu, Keyu
    Esfahani, Mahdi Abolfazli
    Yuan, Shenghai
    Wang, Han
    SENSORS, 2018, 18 (11)
  • [27] Autonomous exploration through deep reinforcement learning
    Yan, Xiangda
    Huang, Jie
    He, Keyan
    Hong, Huajie
    Xu, Dasheng
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2023, 50 (05): : 793 - 803
  • [28] The Advance of Reinforcement Learning and Deep Reinforcement Learning
    Lyu, Le
    Shen, Yang
    Zhang, Sicheng
    2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 644 - 648
  • [29] From Design to Deployment of Zero Touch Deep Reinforcement Learning WLANs
    Iacoboaiea, Ovidiu
    Krolikowski, Jonatan
    Houidi, Zied Ben
    Rossi, Dario
    IEEE COMMUNICATIONS MAGAZINE, 2023, 61 (02) : 104 - 109
  • [30] Stage-Wise Categorization and Prediction of Diabetic Retinopathy Using Ensemble Learning and 2D-CNN
    Balamurugan, N. M.
    Maithili, K.
    Babu, T. K. S. Rathish
    Adimoolam, M.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (01): : 499 - 514