共 50 条
- [4] Model-Free Imitation Learning with Policy Optimization INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [8] Adjustable Iterative Q-Learning Schemes for Model-Free Optimal Tracking Control IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (02): : 1202 - 1213
- [9] Policy Gradient Adaptive Critic Designs for Model-Free Optimal Tracking Control With Experience Replay IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (06): : 3692 - 3703
- [10] Optimal Online Learning Procedures for Model-Free Policy Evaluation MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 473 - +