Supervised autonomy for online learning in human-robot interaction

被引:30
|
作者
Senft, Emmanuel [1 ]
Baxter, Paul [2 ]
Kennedy, James [1 ]
Lemaignan, Severin [1 ]
Belpaeme, Tony [1 ,3 ]
机构
[1] Plymouth Univ, Plymouth PL4 8AA, Devon, England
[2] Univ Lincoln, Lincoln Ctr Autonomous Syst, Lincoln LN6 7TS, England
[3] Univ Ghent, Dept Elect & Informat Syst, Imec IDLab, Ghent, Belgium
关键词
Human-Robot interaction; Reinforcement learning; Interactive machine learning; Robotics; Progressive Autonomy; Supervised autonomy;
D O I
10.1016/j.patrec.2017.03.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When a robot is learning it needs to explore its environment and how its environment responds on its actions. When the environment is large and there are a large number of possible actions the robot can take, this exploration phase can take prohibitively long. However, exploration can often be optimised by letting a human expert guide the robot during its learning. Interactive machine learning, in which a human user interactively guides the robot as it learns, has been shown to be an effective way to teach a robot. It requires an intuitive control mechanism to allow the human expert to provide feedback on the robot's progress. This paper presents a novel method which combines Reinforcement Learning and Supervised Progressively Autonomous Robot Competencies (SPARC). By allowing the user to fully control the robot and by treating rewards as implicit, SPARC aims to learn an action policy while maintaining human supervisory oversight of the robot's behaviour. This method is evaluated and compared to Interactive Reinforcement Learning in a robot teaching task. Qualitative and quantitative results indicate that SPARC allows for safer and faster learning by the robot, whilst not placing a high workload on the human teacher. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:77 / 86
页数:10
相关论文
共 50 条
  • [41] A Learning-Based Adjustable Autonomy Framework for Human-Robot Collaboration
    Rabby, Md Khurram Monir
    Karimoddini, Ali
    Khan, Mubbashar Altaf
    Jiang, Steven
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (09) : 6171 - 6180
  • [42] A General Pipeline for Online Gesture Recognition in Human-Robot Interaction
    Villani, Valeria
    Secchi, Cristian
    Lippi, Marco
    Sabattini, Lorenzo
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2023, 53 (02) : 315 - 324
  • [43] Conceptual Imitation Learning in a Human-Robot Interaction Paradigm
    Hajimirsadeghi, Hossein
    Ahmadabadi, Majid Nili
    Araabi, Babak Nadjar
    Moradi, Hadi
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2012, 3 (02)
  • [44] Physical Human-Robot Interaction Mutual Learning and Adaptation
    Ikemoto, Shuhei
    Ben Amor, Heni
    Minato, Takashi
    Jung, Bernhard
    Ishiguro, Hiroshi
    IEEE ROBOTICS & AUTOMATION MAGAZINE, 2012, 19 (04) : 24 - 35
  • [45] Learning from Human Collaborative Experience: Robot Learning via Crowdsourcing of Human-Robot Interaction
    Tan, Jeffrey Too Chuan
    Hagiwara, Yoshinobu
    Inamura, Tetsunari
    COMPANION OF THE 2017 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI'17), 2017, : 297 - 298
  • [46] Impedance Variation and Learning Strategies in Human-Robot Interaction
    Sharifi, Mojtaba
    Zakerimanesh, Amir
    Mehr, Javad K.
    Torabi, Ali
    Mushahwar, Vivian K.
    Tavakoli, Mahdi
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) : 6462 - 6475
  • [47] Transparent Interaction Based Learning for Human-Robot Collaboration
    Bagheri, Elahe
    de Winter, Joris
    Vanderborght, Bram
    FRONTIERS IN ROBOTICS AND AI, 2022, 9
  • [48] Understanding and learning of gestures through human-robot interaction
    Kuno, Y
    Murashima, T
    Shimada, N
    Shirai, Y
    2000 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2000), VOLS 1-3, PROCEEDINGS, 2000, : 2133 - 2138
  • [49] Conceptual Imitation Learning: An Application to Human-Robot Interaction
    Hajimirsadeghi, Hossein
    Ahmadabadi, Majid Nili
    Ajallooeian, Mostafa
    Araabi, Babak Nadjar
    Moradi, Hadi
    PROCEEDINGS OF 2ND ASIAN CONFERENCE ON MACHINE LEARNING (ACML2010), 2010, 13 : 331 - 346
  • [50] The Impact of Personalisation on Human-Robot Interaction in Learning Scenarios
    Churamani, Nikhil
    Anton, Paul
    Bruegger, Marc
    Fliesswasser, Erik
    Hummel, Thomas
    Mayer, Julius
    Mustafa, Waleed
    Ng, Hwei Geok
    Thi Linh Chi Nguyen
    Quan Nguyen
    Soll, Marcus
    Springenberg, Sebastian
    Griffiths, Sascha
    Heinrich, Stefan
    Navarro-Guerrero, Nicolas
    Strahl, Erik
    Twiefel, Johannes
    Weber, Cornelius
    Wermter, Stefan
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON HUMAN AGENT INTERACTION (HAI'17), 2017, : 171 - 180