共 50 条
- [1] Model-free Policy Learning with Reward Gradients INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [2] Model-Free Trajectory Optimization for Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [5] Policy Learning with Constraints in Model-free Reinforcement Learning: A Survey PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4508 - 4515
- [7] Model-Free Unsupervised Learning for Optimization Problems with Constraints PROCEEDINGS OF 2019 25TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC), 2019, : 392 - 397
- [10] Optimal Online Learning Procedures for Model-Free Policy Evaluation MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 473 - +