共 50 条
- [2] ORAD: a new framework of offline Reinforcement Learning with Q-value regularization Evolutionary Intelligence, 2024, 17 : 339 - 347
- [4] Offline Reinforcement Learning with Fisher Divergence Critic Regularization INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [5] Supported Policy Optimization for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [6] Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [7] Offline Reinforcement Learning with Uncertainty Critic Regularization Based on Density Estimation 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
- [8] Offline Reinforcement Learning with On-Policy Q-Function Regularization MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 455 - 471
- [9] Towards Offline Reinforcement Learning with Pessimistic Value Priors EPISTEMIC UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, EPI UAI 2023, 2024, 14523 : 89 - 100
- [10] Conservative State Value Estimation for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,