共 50 条
- [22] Residual-gradient-based neural reinforcement learning for the optimal control of an acrobot PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL, 2002, : 758 - 763
- [25] An optimal control method for hybrid systems based on Q-learning for an intersection traffic signal control Gaojishu Tongxin/Chinese High Technology Letters, 2007, 17 (05): : 498 - 502