共 50 条
- [2] Post: Device Placement with Cross-Entropy Minimization and Proximal Policy Optimization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [3] An inexact proximal regularization method for unconstrained optimization Mathematical Methods of Operations Research, 2017, 85 : 43 - 59
- [4] Proximal Policy Optimization With Policy Feedback IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (07): : 4600 - 4610
- [7] Coordinated Proximal Policy Optimization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [8] Truly Proximal Policy Optimization 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 113 - 122
- [10] Off-Policy Proximal Policy Optimization THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9162 - 9170