共 50 条
- [32] Constraints Penalized Q-learning for Safe Offline Reinforcement Learning THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8753 - 8760
- [33] Reinforcement distribution in continuous state action space fuzzy Q-learning: A novel approach FUZZY LOGIC AND APPLICATIONS, 2006, 3849 : 40 - 45
- [35] Q-learning agents in a Cournot oligopoly model JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2008, 32 (10): : 3275 - 3293
- [36] Q-Learning Transformation for Training on JADE Agents REVISTA DIGITAL LAMPSAKOS, 2015, (14): : 25 - 32
- [37] Multiagent Q-learning with Sub-Team Coordination ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [38] Q-learning as a model of utilitarianism in a human–machine team Neural Computing and Applications, 2023, 35 : 16853 - 16864
- [39] The Sample Complexity of Teaching-by-Reinforcement on Q-Learning THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10939 - 10947