共 50 条
- [1] Direct value-approximation for factored MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1579 - 1586
- [2] Basis Refinement Strategies for Linear Value Function Approximation in MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
- [3] Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [6] Refined Regret for Adversarial MDPs with Linear Function Approximation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
- [7] An Analysis of Laplacian Methods for Value Function Approximation in MDPs 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2574 - 2579
- [8] Pseudo-MDPs and Factored Linear Action Models 2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 189 - 197
- [9] Computing factored value functions for policies in structured MDPs IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, 1999, : 1332 - 1339
- [10] Online Learning in MDPs with Linear Function Approximation and Bandit Feedback ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34