共 50 条
- [41] Bellman Residual Orthogonalization for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [42] Offline Reinforcement Learning with Behavioral Supervisor Tuning PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 4929 - 4937
- [43] Offline Reinforcement Learning for Automated Stock Trading IEEE ACCESS, 2023, 11 : 112577 - 112589
- [44] On the Role of Discount Factor in Offline Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [45] Offline Evaluation of Online Reinforcement Learning Algorithms THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1926 - 1933
- [49] Supported Policy Optimization for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [50] Corruption-Robust Offline Reinforcement Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 5757 - 5773