共 50 条
- [22] Learning Behavior of Offline Reinforcement Learning Agents ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS VI, 2024, 13051
- [23] Optimizing Policy via Deep Reinforcement Learning for Dialogue Management 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 582 - 589
- [25] Optimizing trajectories for highway driving with offline reinforcement learning FRONTIERS IN FUTURE TRANSPORTATION, 2023, 4
- [26] Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [27] Offline Reinforcement Learning via Policy Regularization and Ensemble Q-Functions 2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1167 - 1174
- [28] Supported Policy Optimization for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [30] Weighted Policy Constraints for Offline Reinforcement Learning THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9435 - 9443