Learning to touch objects through stage-wise deep reinforcement learning

被引:0
|
作者
de La Bourdonnaye, Francois [1 ]
Teuliere, Celine [1 ]
Triesch, Jochen [2 ]
Chateau, Thierry [1 ]
机构
[1] Univ Clermont Auvergne, Pascal Inst, CNRS, UMR6602, Aubiere, France
[2] Frankfurt Inst Adv Studies, Frankfurt, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning complex behaviors through reinforcement learning is particularly challenging when reward is only available upon successful completion of the full behavior. In manipulation robotics, so-called shaping rewards are often used to overcome this problem. However, these usually require human engineering or (partial) world models describing, e.g., the kinematics of the robot or high-level modules for perception. Here we propose an alternative method to learn an object palm-touching task through a weakly-supervised and stage-wise learning of simpler tasks. First, the robot learns to fixate the object with its cameras. Second, the robot learns eye-hand coordination by learning to fixate its end effector. Third, using the previously acquired skills an informative shaping reward can be computed which facilitates efficient learning of the object palm-touching task. We demonstrate in simulation that learning the full task with this shaping reward is comparable to learning with an informative supervised reward.
引用
收藏
页码:7789 / 7794
页数:6
相关论文
共 50 条
  • [41] Learning to Drive with Deep Reinforcement Learning
    Chukamphaeng, Nut
    Pasupa, Kitsuchart
    Antenreiter, Martin
    Auer, Peter
    2021 13TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST-2021), 2021, : 147 - 152
  • [42] Passing Through Narrow Gaps with Deep Reinforcement Learning
    Tidd, Brendan
    Cosgun, Akansel
    Leitner, Jurgen
    Hudson, Nicolas
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3492 - 3498
  • [43] IFRIT: Focused Testing through Deep Reinforcement Learning
    Romdhana, Andrea
    Ceccato, Mariano
    Merlo, Alessio
    Tonella, Paolo
    2022 IEEE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST 2022), 2022, : 24 - 34
  • [44] Push Recovery Strategies through Deep Reinforcement Learning
    Melo, Dicksiano Carvalho
    Maximo, Marcos R. O. A.
    da Cunha, Adilson Marques
    2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 240 - 245
  • [45] Precise atom manipulation through deep reinforcement learning
    I-Ju Chen
    Markus Aapro
    Abraham Kipnis
    Alexander Ilin
    Peter Liljeroth
    Adam S. Foster
    Nature Communications, 13
  • [46] Dexterous in-hand manipulation of slender cylindrical objects through deep reinforcement learning with tactile sensing
    Hu, Wenbin
    Huang, Bidan
    Lee, Wang Wei
    Yang, Sicheng
    Zheng, Yu
    Li, Zhibin
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2025, 186
  • [47] Generation of ice states through deep reinforcement learning
    Zhao, Kai-Wen
    Kao, Wen-Han
    Wu, Kai-Hsin
    Kao, Ying-Jer
    PHYSICAL REVIEW E, 2019, 99 (06)
  • [48] Universal quantum control through deep reinforcement learning
    Niu, Murphy Yuezhen
    Boixo, Sergio
    Smelyanskiy, Vadim N.
    Neven, Hartmut
    NPJ QUANTUM INFORMATION, 2019, 5 (1)
  • [49] Supply Chain Synchronization Through Deep Reinforcement Learning
    Jackson, Ilya
    TRANSBALTICA XII: TRANSPORTATION SCIENCE AND TECHNOLOGY, 2022, : 490 - 498
  • [50] Market Making With Signals Through Deep Reinforcement Learning
    Gasperov, Bruno
    Kostanjcar, Zvonko
    IEEE ACCESS, 2021, 9 : 61611 - 61622