Density-based Data Pruning Method for Deep Reinforcement Learning

被引：0

作者：

Rojanaarpa, Teerapat ^{[1
]}

Kataeva, Irina ^{[1
]}

机构：

[1] DENSO Corp, Komenoki, Nisshin 4700111, Japan

来源：

2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016) | 2016年

关键词：

D O I：

10.1109/ICMLA.2016.76

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a density-based Data Pruning method for Deep Reinforcement Learning (DRL) to improve learning stability and long-term memory in rare situations. The method controls density distribution in the experience pool by discarding high correlation data and preserving rare and unique data. We apply our method to Deep Q-networks (DQN) and Deep Deterministic Policy Gradients (DDPG) for testing in discrete and continuous action space, respectively. We evaluate our method on path following tasks in a simulated physical environment. Compared to other conventional methods such as First-In-First-Out (FIFO), our method provides a significant improvement in performance and learning stability; the average cumulative reward is increased by up to 21% and the standard deviation of the cumulative reward over multiple trials is reduced by 80%. In addition, long-term memory improvement is shown as the agent can remember and perform a behavior corresponding to a past rare event.

引用

页码：266 / 271

页数：6

共 50 条

[31] A local density-based outlier detection method for high dimension data
Abdulghafoor, Shahad Adel
Mohamed, Lekaa Ali
INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2022, 13 (01): : 1683 - 1699
[32] An improved method for density-based clustering
Jin, Hong
Wang, Shuliang
Zhou, Qian
Li, Ying
INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2014, 6 (04) : 347 - 368
[33] An ensemble density-based clustering method
Xia, Luning
Jing, Jiwu
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (ISKE 2007), 2007,
[34] A Dual Deep Network Based Secure Deep Reinforcement Learning Method
Zhu F.
Wu W.
Fu Y.-C.
Liu Q.
Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (08): : 1812 - 1826
[35] Node selection method in federated learning based on deep reinforcement learning
He W.
Guo S.
Qiu X.
Chen L.
Zhang S.
Tongxin Xuebao/Journal on Communications, 2021, 42 (06): : 62 - 71
[36] A Density-Based Re-ranking Technique for Active Learning for Data Annotations
Zhu, Jingbo
Wang, Huizhen
Tsou, Benjamin K.
COMPUTER PROCESSING OF ORIENTAL LANGUAGES: LANGUAGE TECHNOLOGY FOR THE KNOWLEDGE-BASED ECONOMY, 2009, 5459 : 1 - +
[37] Research on overall energy consumption optimization method for data center based on deep reinforcement learning
Wang Simin
Qin Lulu
Ma Chunmiao
Wu Weiguo
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (05) : 7333 - 7349
[38] Method for evaluation on energy consumption of cloud computing data center based on deep reinforcement learning
Ma, Haizhou
Ding, Aiping
ELECTRIC POWER SYSTEMS RESEARCH, 2022, 208
[39] QoS-aware data center network reconfiguration method based on deep reinforcement learning
Guo, Xiaotao
Yan, Fulong
Xue, Xuwei
Pan, Bitao
Exarchakos, George
Calabretta, Nicola
JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2021, 13 (05) : 94 - 107
[40] Reaching Pruning Locations in a Vine Using a Deep Reinforcement Learning Policy
Yandun, Francisco
Parhar, Tanvir
Silwal, Abhisesh
Clifford, David
Yuan, Zhiqiang
Levine, Gabriella
Yaroshenko, Sergey
Kantor, George
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 2400 - 2406

← 1 2 3 4 5 →