Teachable robots: Understanding human teaching behavior to build more effective robot learners

被引:213
|
作者
Thomaz, Andrea L. [1 ]
Breazeal, Cynthia [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
human-robot interaction; reinforcement learning; user studies;
D O I
10.1016/j.artint.2007.09.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While Reinforcement Learning (RL) is not traditionally designed for interactive supervisory input from a human teacher, several works in both robot and software agents have adapted it for human input by letting a human trainer control the reward signal. In this work, we experimentally examine the assumption underlying these works, namely that the human-given reward is compatible with the traditional RL reward signal. We describe an experimental platform with a simulated RL robot and present an analysis of real-time human teaching behavior found in a study in which untrained subjects taught the robot to perform a new task. We report three main observations on how people administer feedback when teaching a Reinforcement Learning agent: (a) they use the reward channel not only for feedback, but also for future-directed guidance; (b) they have a positive bias to their feedback, possibly using the signal as a motivational channel; and (c) they change their behavior as they develop a mental model of the robotic learner. Given this, we made specific modifications to the simulated RL robot, and analyzed and evaluated its learning behavior in four follow-up experiments with human trainers. We report significant improvements on several learning measures. This work demonstrates the importance of understanding the human-teacher/robot-learner partnership in order to design algorithms that support how people want to teach and simultaneously improve the robot's learning behavior. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:716 / 737
页数:22
相关论文
共 50 条
  • [31] Understanding human-robot proxemic norms in construction: How do humans navigate around robots?
    Kim, Yeseul
    Kim, Seongyong
    Chen, Yilong
    Yang, Hyunjin
    Kim, Seungwoo
    Ha, Sehoon
    Gombolay, Matthew
    Ahn, Yonghan
    Cho, Yong Kwon
    AUTOMATION IN CONSTRUCTION, 2024, 164
  • [32] Social robots in a project-based learning environment: Adolescent understanding of robot-human interactions
    LeTendre, Gerald K.
    Gray, Raisa
    JOURNAL OF COMPUTER ASSISTED LEARNING, 2024, 40 (01) : 192 - 204
  • [33] Understanding human-robot teamwork in the wild: The difference between success and failure for mobile robots in hospitals
    Eriksen, Kristina Tornbjerg
    Bodenhagen, Leon
    2023 32ND IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, RO-MAN, 2023, : 277 - 284
  • [34] Motion Planning of Multiple Mobile Robots Based on Artificial Potential for Human Behavior and Robot Congestion
    Hoshino, Satoshi
    Maki, Koichiro
    DISTRIBUTED AUTONOMOUS ROBOTIC SYSTEMS, 2016, 112 : 311 - 324
  • [35] A Collaborative Homeostatic-Based Behavior Controller for Social Robots in Human–Robot Interaction Experiments
    Hoang-Long Cao
    Pablo Gómez Esteban
    De Beir Albert
    Ramona Simut
    Greet Van de Perre
    Dirk Lefeber
    Bram Vanderborght
    International Journal of Social Robotics, 2017, 9 : 675 - 690
  • [36] Knowledge-based autonomous behavior control of robots in a symbiotic human-robot system
    Zhang, Tao
    Ueno, Haruki
    2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-3, 2006, : 796 - +
  • [37] Developing a Portable Human-Robot Interaction (HRI) Framework for Outdoor Robots Through Selective Compartmentalization Effective Integration of the Robot Operating System (ROS) and Android for Outdoor Robots
    Hajjaj, Sami Salama Hussen
    Sahari, Khairul Salleh Mohamed
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (11) : 9779 - 9786
  • [38] Toward understanding social cues and signals in human-robot interaction: effects of robot gaze and proxemic behavior
    Fiore, Stephen M.
    Wiltshire, Travis J.
    Lobato, Emilio J. C.
    Jentsch, Florian G.
    Huang, Wesley H.
    Axelrod, Benjamin
    FRONTIERS IN PSYCHOLOGY, 2013, 4
  • [39] Ethological evaluation of Human-Robot Interaction: are children more efficient and motivated with computer, virtual agent or robots?
    Jost, C.
    Andre, V.
    Le Pevedic, B.
    Lemasson, A.
    Hausberger, M.
    Duhaut, D.
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2012), 2012,
  • [40] UNDERSTANDING HUMAN-BEHAVIOR FOR EFFECTIVE POLICE WORK - RUSSELL,HE AND BEIGEL,A
    HOWARD, DR
    SOCIAL SCIENCE JOURNAL, 1977, 14 (02): : 148 - 157