共 19 条
- [11] CHENG R, OROSZ G, MURRAY R M, Et al., End-to-end safe reinforcement learning through barrier functions for safety-critical continuous control tasks, Proceedings of the AAAI Conference on Artificial Intelligence, pp. 3-6, (2019)
- [12] ELFWING S, SEYMOUR B., Parallel reward and punishment control in humans and robots: safe reinforcement learning using the MaxPain algorithm, 2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, pp. 140-147, (2017)
- [13] DJORDJE G, SEBASTIAN R., Safe reinforcement learning through meta-learned instincts, Proceedings of the ALIFE 2020: the 2020 Conference on Artificial Life, pp. 283-291, (2020)
- [14] SUTTON R, BARTO A., Reinforcement learning: an introduction, (2017)
- [15] SUGIYAMA M., Statistical reinforcement learning. Modern machine learning approaches, 11, 4, pp. 1330-1340, (2015)
- [16] WANG W, YU N, GAO Y, Et al., Safe off-policy deep rein-forcement learning algorithm for volt-var control in power distribution systems, IEEE Transactions on Smart Grid, 11, 4, pp. 3008-3018, (2020)
- [17] BAIRD L., Residual algorithms: reinforcement learning with function approximation, (1995)
- [18] LI Bin, PENG Shurong, PENG Junzhe, Et al., Probability density prediction of wind power based on deep learning quantile regression model, Electric Power Automation Equipment, 38, 9, pp. 15-20, (2018)
- [19] CHEN Xiliang, CAO Lei, LI Chenxi, Et al., A deep reinforcement learning method based on re-sampling optimal cache experience replay mechanism, Control and Decision, 33, 4, pp. 600-606, (2018)