Selective eye-gaze augmentation to enhance imitation learning in Atari games

被引:5
|
作者
Thammineni, Chaitanya [1 ]
Manjunatha, Hemanth [1 ]
Esfahani, Ehsan T. [1 ]
机构
[1] Univ Buffalo, Human Loop Syst Lab, Buffalo, NY 14260 USA
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 32期
关键词
Imitation learning; Human-in-the-loop learning; Learning by demonstration; MOVEMENTS; ATTENTION; SALIENCY;
D O I
10.1007/s00521-021-06367-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the selective use of eye-gaze information in learning human actions in Atari games. Extensive evidence suggests that our eye movements convey a wealth of information about the direction of our attention and mental states and encode the information necessary to complete a task. Based on this evidence, we hypothesize that selective use of eye-gaze, as a clue for attention direction, will enhance the learning from demonstration. For this purpose, we propose a selective eye-gaze augmentation (SEA) network that learns when to use the eye-gaze information. The proposed network architecture consists of three sub-networks: gaze prediction, gating, and action prediction network. Using the prior 4 game frames, a gaze map is predicted by the gaze prediction network, which is used for augmenting the input frame. The gating network will determine whether the predicted gaze map should be used in learning and is fed to the final network to predict the action at the current frame. To validate this approach, we use publicly available Atari Human Eye-Tracking And Demonstration (Atari-HEAD) dataset consists of 20 Atari games with 28 million human demonstrations and 328 million eye-gazes (over game frames) collected from four subjects. We demonstrate the efficacy of selective eye-gaze augmentation compared to the state-of-the-art Attention Guided Imitation Learning (AGIL) and Behavior Cloning (BC). The results indicate that the selective augmentation approach (the SEA network) performs significantly better than the AGIL and BC. Moreover, to demonstrate the significance of selective use of gaze through the gating network, we compare our approach with the random selection of the gaze. Even in this case, the SEA network performs significantly better, validating the advantage of selectively using the gaze in demonstration learning.
引用
收藏
页码:23401 / 23410
页数:10
相关论文
共 20 条
  • [1] Selective eye-gaze augmentation to enhance imitation learning in Atari games
    Chaitanya Thammineni
    Hemanth Manjunatha
    Ehsan T. Esfahani
    Neural Computing and Applications, 2023, 35 : 23401 - 23410
  • [2] Eye-Gaze Controlled Wheelchair Based on Deep Learning
    Xu, Jun
    Huang, Zuning
    Liu, Liangyuan
    Li, Xinghua
    Wei, Kai
    SENSORS, 2023, 23 (13)
  • [3] Hierarchical CNN and Ensemble Learning for Efficient Eye-Gaze Detection
    Kumar, G. R. Karthik
    Sandhan, Tushar
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 : 529 - 541
  • [4] Using Eye Gaze to Enhance Generalization of Imitation Networks to Unseen Environments
    Liu, Congcong
    Chen, Yuying
    Liu, Ming
    Shi, Bertram E.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 2066 - 2074
  • [5] For Your Eyes Only: Controlling 3D Online Games by Eye-Gaze
    Istance, Howell
    Hyrskykari, Aulikki
    Vickers, Stephen
    Chaves, Thiago
    HUMAN-COMPUTER INTERACTION - INTERACT 2009, PT I, 2009, 5726 : 314 - +
  • [6] Word learning in monolingual and bilingual children: The influence of speaker eye-gaze
    Gangopadhyay, Ishanti
    Kaushanskaya, Margarita
    BILINGUALISM-LANGUAGE AND COGNITION, 2021, 24 (02) : 333 - 343
  • [7] Predicting Students Performance Using Eye-Gaze Features in an Embodied Learning Environment
    Chettaoui, Neila
    Atia, Ayman
    Bouhlel, Med Salim
    PROCEEDINGS OF THE 2022 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON 2022), 2022, : 704 - 711
  • [8] Incidental learning of trust from eye-gaze: Effects of race and facial trustworthiness
    Strachan, James W. A.
    Kirkham, Alexander J.
    Manssuer, Luis R.
    Over, Harriet
    Tipper, Steven P.
    VISUAL COGNITION, 2017, 25 (7-8) : 802 - 814
  • [9] SAT Reading Analysis Using Eye-Gaze Tracking Technology and Machine Learning
    Howe, Andrew
    Phong Nguyen
    INTELLIGENT TUTORING SYSTEMS, ITS 2018, 2018, 10858 : 332 - 338
  • [10] Improving the Effectiveness of E-learning Videos by leveraging Eye-gaze Data
    Saxena, Rakhi
    Narang, Sunita
    Ahuja, Harita
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2023, 13 (06) : 12354 - 12359