共 50 条
- [1] Proximal Policy Optimization with Relative Pearson Divergence 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 8416 - 8421
- [3] Proximal Policy Optimization With Policy Feedback IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (07): : 4600 - 4610
- [4] Improving Proximal Policy Optimization Algorithm in Interactive Multi-agent Systems 2024 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, ICDL 2024, 2024,
- [5] Coordinated Proximal Policy Optimization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [6] Truly Proximal Policy Optimization 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 113 - 122
- [8] Off-Policy Proximal Policy Optimization THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9162 - 9170
- [9] Divergence-Augmented Policy Optimization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32