Evolving Deep Unsupervised Convolutional Networks for Vision-Based Reinforcement Learning

被引:73
|
作者
Koutnik, Jan [1 ]
Schmidhuber, Juergen [1 ]
Gomez, Faustino [1 ]
机构
[1] USI SUPSI, IDSIA, CH-6928 Manno Lugano, Switzerland
关键词
deep learning; neuroevolution; vision-based TORCS; reinforcement learning; games; NEURAL NETS;
D O I
10.1145/2576768.2598358
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dealing with high-dimensional input spaces, like visual input, is a challenging task for reinforcement learning (RL). Neuroevolution (NE), used for continuous RL problems, has to either reduce the problem dimensionality by (1) compressing the representation of the neural network controllers or (2) employing a pre-processor (compressor) that transforms the high-dimensional raw inputs into low-dimensional features. In this paper, we are able to evolve extremely small recurrent neural network (RNN) controllers for a task that previously required networks with over a million weights. The high-dimensional visual input, which the controller would normally receive, is first transformed into a compact feature vector through a deep, max-pooling convolutional neural network (MPCNN). Both the MPCNN preprocessor and the RNN controller are evolved successfully to control a car in the TORCS racing simulator using only visual input. This is the first use of deep learning in the context evolutionary RL.
引用
收藏
页码:541 / 548
页数:8
相关论文
共 50 条
  • [41] Approximate Inverse Reinforcement Learning from Vision-based Imitation Learning
    Lee, Keuntaek
    Vlahov, Bogdan
    Gibson, Jason
    Rehg, James M.
    Theodorou, Evangelos A.
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 10793 - 10799
  • [42] Vision-Based Vehicle Behavior Analysis: A Structured Learning Approach via Convolutional Neural Networks
    Mou, Luntian
    Xie, Haitao
    Chen, Yanyan
    CICTP 2019: TRANSPORTATION IN CHINA-CONNECTING THE WORLD, 2019, : 5709 - 5720
  • [43] Vision-based vehicle behaviour analysis: a structured learning approach via convolutional neural networks
    Mou, Luntian
    Xie, Haitao
    Mao, Shasha
    Zhao, Pengfei
    Chen, Yanyan
    IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (07) : 792 - 801
  • [44] Robot Manipulation of Dynamic Object with Vision-based Reinforcement Learning
    Liu, Chenchen
    Zhang, Zhengshen
    Zhou, Lei
    Liu, Zhiyang
    Ang, Marcelo H., Jr.
    Lu, Wenfeng
    Tay, Francis E. H.
    2024 9TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING, ICCRE 2024, 2024, : 21 - 26
  • [45] A Bayesian Approach to Reinforcement Learning of Vision-Based Vehicular Control
    Gharaee, Zahra
    Holmquist, Karl
    He, Linbo
    Felsberg, Michael
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3947 - 3954
  • [46] Vision-Based Reinforcement Learning using Approximate Policy Iteration
    Shaker, Marwan R.
    Yue, Shigang
    Duckett, Tom
    ICAR: 2009 14TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS, VOLS 1 AND 2, 2009, : 594 - 599
  • [47] Vision-Based Autonomous Driving: A Hierarchical Reinforcement Learning Approach
    Wang, Jiao
    Sun, Haoyi
    Zhu, Can
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) : 11213 - 11226
  • [48] Vision-based reinforcement learning control of soft robot manipulators
    Li, Jinzhou
    Ma, Jie
    Hu, Yujie
    Zhang, Li
    Liu, Zhijie
    Sun, Shiying
    ROBOTIC INTELLIGENCE AND AUTOMATION, 2024, 44 (06): : 783 - 790
  • [49] Machine Vision-Based Prediction of Lettuce Phytomorphological Descriptors Using Deep Learning Networks
    Lauguico, Sandy
    Concepcion, Ronnie, II
    Tobias, Rogelio Ruzcko
    Alejandrino, Jonnel
    De Guia, Justin
    Guillermo, Marielet
    Sybingco, Edwin
    Dadios, Elmer
    2020 IEEE 12TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT, AND MANAGEMENT (HNICEM), 2020,
  • [50] Vision-based Obstacle Avoidance Using Deep Learning
    Gaya, Joel O.
    Goncalves, Lucas T.
    Duarte, Amanda C.
    Zanchetta, Breno
    Drews-, Paulo, Jr.
    Botelho, Silvia S. C.
    PROCEEDINGS OF 13TH LATIN AMERICAN ROBOTICS SYMPOSIUM AND 4TH BRAZILIAN SYMPOSIUM ON ROBOTICS - LARS/SBR 2016, 2016, : 7 - 12