Evolving Deep Unsupervised Convolutional Networks for Vision-Based Reinforcement Learning

被引:73
|
作者
Koutnik, Jan [1 ]
Schmidhuber, Juergen [1 ]
Gomez, Faustino [1 ]
机构
[1] USI SUPSI, IDSIA, CH-6928 Manno Lugano, Switzerland
来源
GECCO'14: PROCEEDINGS OF THE 2014 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE | 2014年
关键词
deep learning; neuroevolution; vision-based TORCS; reinforcement learning; games; NEURAL NETS;
D O I
10.1145/2576768.2598358
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dealing with high-dimensional input spaces, like visual input, is a challenging task for reinforcement learning (RL). Neuroevolution (NE), used for continuous RL problems, has to either reduce the problem dimensionality by (1) compressing the representation of the neural network controllers or (2) employing a pre-processor (compressor) that transforms the high-dimensional raw inputs into low-dimensional features. In this paper, we are able to evolve extremely small recurrent neural network (RNN) controllers for a task that previously required networks with over a million weights. The high-dimensional visual input, which the controller would normally receive, is first transformed into a compact feature vector through a deep, max-pooling convolutional neural network (MPCNN). Both the MPCNN preprocessor and the RNN controller are evolved successfully to control a car in the TORCS racing simulator using only visual input. This is the first use of deep learning in the context evolutionary RL.
引用
收藏
页码:541 / 548
页数:8
相关论文
共 50 条
  • [21] Deep Convolutional Neural Networks with transfer learning for computer vision-based data-driven pavement distress detection
    Gopalakrishnan, Kasthurirangan
    Khaitan, Siddhartha K.
    Choudhary, Alok
    Agrawal, Ankit
    CONSTRUCTION AND BUILDING MATERIALS, 2017, 157 : 322 - 330
  • [22] Vision-Based Autonomous Navigation Approach for a Tracked Robot Using Deep Reinforcement Learning
    Ejaz, Muhammad Mudassir
    Tang, Tong Boon
    Lu, Cheng-Kai
    IEEE SENSORS JOURNAL, 2021, 21 (02) : 2230 - 2240
  • [23] Vision-based Navigation of UAV with Continuous Action Space Using Deep Reinforcement Learning
    Zhou, Benchun
    Wang, Weihong
    Liu, Zhenghua
    Wang, Jia
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 5030 - 5035
  • [24] Adversarial Black-Box Attacks on Vision-based Deep Reinforcement Learning Agents
    Tanev, Atanas
    Pavlitskaya, Svetlana
    Sigloch, Joan
    Roennau, Arne
    Dillmann, Ruediger
    Zoellner, J. Marius
    2021 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SAFETY FOR ROBOTICS (ISR), 2021, : 177 - 181
  • [25] A Deep Reinforcement Learning Technique for Vision-Based Autonomous Multirotor Landing on a Moving Platform
    Rodriguez-Ramos, Alejandro
    Sampedro, Carlos
    Bavle, Hriday
    Gil Moreno, Ignacio
    Campoy, Pascual
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 1010 - 1017
  • [26] Deep Reinforcement Learning for Vision-Based Navigation of UAVs in Avoiding Stationary and Mobile Obstacles
    Kalidas, Amudhini P.
    Joshua, Christy Jackson
    Md, Abdul Quadir
    Basheer, Shakila
    Mohan, Senthilkumar
    Sakri, Sapiah
    DRONES, 2023, 7 (04)
  • [27] Vision-Based Deep Reinforcement Learning For UR5 Robot Motion Control
    Jiang, Rong
    Wang, Zhipeng
    He, Bin
    Di, Zhou
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS AND COMPUTER ENGINEERING (ICCECE), 2021, : 246 - 250
  • [28] CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-Based Autonomous Urban Driving
    Zhao, Yinuo
    Wu, Kun
    Xu, Zhiyuan
    Che, Zhengping
    Lu, Qi
    Tang, Jian
    Liu, Chi Harold
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3481 - 3489
  • [29] Intrinsically Motivated NeuroEvolution for Vision-Based Reinforcement Learning
    Cuccu, Giuseppe
    Luciw, Matthew
    Schmidhuber, Juergen
    Gomez, Faustino
    2011 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING (ICDL), 2011,
  • [30] Continual Vision-based Reinforcement Learning with Group Symmetries
    Liu, Shiqi
    Xu, Mengdi
    Huang, Peide
    Zhang, Xilun
    Liu, Yongkang
    Oguchi, Kentaro
    Zhao, Ding
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229