Learning to touch objects through stage-wise deep reinforcement learning

被引:0
|
作者
de La Bourdonnaye, Francois [1 ]
Teuliere, Celine [1 ]
Triesch, Jochen [2 ]
Chateau, Thierry [1 ]
机构
[1] Univ Clermont Auvergne, Pascal Inst, CNRS, UMR6602, Aubiere, France
[2] Frankfurt Inst Adv Studies, Frankfurt, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning complex behaviors through reinforcement learning is particularly challenging when reward is only available upon successful completion of the full behavior. In manipulation robotics, so-called shaping rewards are often used to overcome this problem. However, these usually require human engineering or (partial) world models describing, e.g., the kinematics of the robot or high-level modules for perception. Here we propose an alternative method to learn an object palm-touching task through a weakly-supervised and stage-wise learning of simpler tasks. First, the robot learns to fixate the object with its cameras. Second, the robot learns eye-hand coordination by learning to fixate its end effector. Third, using the previously acquired skills an informative shaping reward can be computed which facilitates efficient learning of the object palm-touching task. We demonstrate in simulation that learning the full task with this shaping reward is comparable to learning with an informative supervised reward.
引用
收藏
页码:7789 / 7794
页数:6
相关论文
共 50 条
  • [1] Semantic Stage-Wise Learning for Knowledge Distillation
    Liu, Dongqin
    Li, Wentao
    Zhou, Wei
    Li, Zhaoxing
    Dai, Jiao
    Han, Jizhong
    Li, Ruixuan
    Hu, Songlin
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 816 - 821
  • [2] Stage-Wise Stochastic Deep Learning Inversion Framework for Subsurface Sedimentary Structure Identification
    Zhan, Chuanjun
    Dai, Zhenxue
    Soltanian, Mohamad Reza
    Zhang, Xiaoying
    GEOPHYSICAL RESEARCH LETTERS, 2022, 49 (01)
  • [3] Progressive Stage-wise Learning for Unsupervised Feature Representation Enhancement
    Li, Zefan
    Liu, Chenxi
    Yuille, Alan
    Ni, Bingbing
    Zhang, Wenjun
    Gao, Wen
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9762 - 9771
  • [4] Stage-Wise Learning of Reaching Using Little Prior Knowledge
    de la Bourdonnaye, Francois
    Teuliere, Celine
    Triesch, Jochen
    Chateau, Thierry
    FRONTIERS IN ROBOTICS AND AI, 2018, 5
  • [5] Real-time stage-wise object tracking in traffic scenes: an online tracker selection method via deep reinforcement learning
    Lu, Xiao
    Cao, Yihong
    Liu, Sheng
    Zhou, Xuanyu
    Yang, Yimin
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (24): : 16831 - 16846
  • [6] Real-time stage-wise object tracking in traffic scenes: an online tracker selection method via deep reinforcement learning
    Xiao Lu
    Yihong Cao
    Sheng Liu
    Xuanyu Zhou
    Yimin Yang
    Neural Computing and Applications, 2021, 33 : 16831 - 16846
  • [7] ATTENTION-GUIDED DERAINING NETWORK VIA STAGE-WISE LEARNING
    Jiang, Kui
    Wang, Zhongyuan
    Yi, Peng
    Chen, Chen
    Yang, Yuhong
    Tian, Xin
    Jiang, Junjun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2618 - 2622
  • [8] Learning Stage-Wise GANs for Whistle Extraction in Time-Frequency Spectrograms
    Li, Pu
    Roch, Marie A.
    Klinck, Holger
    Fleishman, Erica
    Gillespie, Douglas
    Nosal, Eva-Marie
    Shiu, Yu
    Liu, Xiaobai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 9302 - 9314
  • [9] Effective feature learning and fusion of multimodality data using stage-wise deep neural network for dementia diagnosis
    Zhou, Tao
    Thung, Kim-Han
    Zhu, Xiaofeng
    Shen, Dinggang
    HUMAN BRAIN MAPPING, 2019, 40 (03) : 1001 - 1016
  • [10] Deep Reinforcement Learning for Page-wise Recommendations
    Zhao, Xiangyu
    Xia, Long
    Zhang, Liang
    Ding, Zhuoye
    Yin, Dawei
    Tang, Jiliang
    12TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS), 2018, : 95 - 103