共 50 条
- [41] Learning Behavior of Offline Reinforcement Learning Agents ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS VI, 2024, 13051
- [43] Error bounds in reinforcement learning policy evaluation ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, 3501 : 438 - 449
- [44] Multigrid methods for policy evaluation and reinforcement learning 2005 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL & 13TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1 AND 2, 2005, : 1391 - 1396
- [45] Least Square Policy Evaluation in Reinforcement Learning INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND INDUSTRIAL AUTOMATION (ICITIA 2015), 2015, : 583 - 590
- [47] Conservative Offline Distributional Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [48] On Efficient Sampling in Offline Reinforcement Learning 2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1 - 6
- [49] Offline Reinforcement Learning with Differential Privacy ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [50] Bootstrapped Transformer for Offline Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,