共 50 条
- [32] Dynamic Grouping within Minimax Optimal Strategy for Stochastic Multi-ArmedBandits in Reinforcement Learning Recommendation APPLIED SCIENCES-BASEL, 2024, 14 (08):
- [36] A reinforcement learning-based scheme for direct adaptive optimal control of linear stochastic systems OPTIMAL CONTROL APPLICATIONS & METHODS, 2010, 31 (04): : 365 - 374
- [37] Reinforcement Learning based on Stochastic Dynamic Programming for Condition-based Maintenance of Deteriorating Production Processes 2022 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), 2022, : 17 - 24
- [38] Reinforcement Learning-Based Dynamic Order Recommendation for On-Demand Food Delivery TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 29 (02): : 356 - 367