共 50 条
- [41] Off-Policy Differentiable Logic Reinforcement Learning MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 617 - 632
- [42] Marginalized Operators for Off-policy Reinforcement Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 655 - 679
- [43] Off-Policy Shaping Ensembles in Reinforcement Learning 21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 1021 - 1022
- [44] Learning Routines for Effective Off-Policy Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [45] Enhanced Strategies for Off-Policy Reinforcement Learning Algorithms in HVAC Control 2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1691 - 1696
- [46] Off-policy Reinforcement Learning for Robust Control of Discrete-time Uncertain Linear Systems PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 2507 - 2512
- [47] Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [50] Off-Policy Risk-Sensitive Reinforcement Learning-Based Constrained Robust Optimal Control IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (04): : 2478 - 2491