共 50 条
- [43] Noisy Bayesian Active Learning 2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 1626 - 1633
- [45] Learning deterministic policies in partially observable Markov decision processes INTELLIGENT AUTONOMOUS SYSTEMS: IAS-5, 1998, : 250 - 257
- [46] Counterexample Explanation by Learning Small Strategies in Markov Decision Processes COMPUTER AIDED VERIFICATION, PT I, 2015, 9206 : 158 - 177
- [48] Learning in Non-Cooperative Configurable Markov Decision Processes ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34