Density-based Data Pruning Method for Deep Reinforcement Learning

被引:0
|
作者
Rojanaarpa, Teerapat [1 ]
Kataeva, Irina [1 ]
机构
[1] DENSO Corp, Komenoki, Nisshin 4700111, Japan
关键词
D O I
10.1109/ICMLA.2016.76
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a density-based Data Pruning method for Deep Reinforcement Learning (DRL) to improve learning stability and long-term memory in rare situations. The method controls density distribution in the experience pool by discarding high correlation data and preserving rare and unique data. We apply our method to Deep Q-networks (DQN) and Deep Deterministic Policy Gradients (DDPG) for testing in discrete and continuous action space, respectively. We evaluate our method on path following tasks in a simulated physical environment. Compared to other conventional methods such as First-In-First-Out (FIFO), our method provides a significant improvement in performance and learning stability; the average cumulative reward is increased by up to 21% and the standard deviation of the cumulative reward over multiple trials is reduced by 80%. In addition, long-term memory improvement is shown as the agent can remember and perform a behavior corresponding to a past rare event.
引用
收藏
页码:266 / 271
页数:6
相关论文
共 50 条
  • [31] A local density-based outlier detection method for high dimension data
    Abdulghafoor, Shahad Adel
    Mohamed, Lekaa Ali
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2022, 13 (01): : 1683 - 1699
  • [32] An improved method for density-based clustering
    Jin, Hong
    Wang, Shuliang
    Zhou, Qian
    Li, Ying
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2014, 6 (04) : 347 - 368
  • [33] An ensemble density-based clustering method
    Xia, Luning
    Jing, Jiwu
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE 2007), 2007,
  • [34] A Dual Deep Network Based Secure Deep Reinforcement Learning Method
    Zhu F.
    Wu W.
    Fu Y.-C.
    Liu Q.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (08): : 1812 - 1826
  • [35] Node selection method in federated learning based on deep reinforcement learning
    He W.
    Guo S.
    Qiu X.
    Chen L.
    Zhang S.
    Tongxin Xuebao/Journal on Communications, 2021, 42 (06): : 62 - 71
  • [36] A Density-Based Re-ranking Technique for Active Learning for Data Annotations
    Zhu, Jingbo
    Wang, Huizhen
    Tsou, Benjamin K.
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES: LANGUAGE TECHNOLOGY FOR THE KNOWLEDGE-BASED ECONOMY, 2009, 5459 : 1 - +
  • [37] Research on overall energy consumption optimization method for data center based on deep reinforcement learning
    Wang Simin
    Qin Lulu
    Ma Chunmiao
    Wu Weiguo
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (05) : 7333 - 7349
  • [38] Method for evaluation on energy consumption of cloud computing data center based on deep reinforcement learning
    Ma, Haizhou
    Ding, Aiping
    ELECTRIC POWER SYSTEMS RESEARCH, 2022, 208
  • [39] QoS-aware data center network reconfiguration method based on deep reinforcement learning
    Guo, Xiaotao
    Yan, Fulong
    Xue, Xuwei
    Pan, Bitao
    Exarchakos, George
    Calabretta, Nicola
    JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2021, 13 (05) : 94 - 107
  • [40] Reaching Pruning Locations in a Vine Using a Deep Reinforcement Learning Policy
    Yandun, Francisco
    Parhar, Tanvir
    Silwal, Abhisesh
    Clifford, David
    Yuan, Zhiqiang
    Levine, Gabriella
    Yaroshenko, Sergey
    Kantor, George
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 2400 - 2406