In this paper, an inverse reinforcement learning method based on particle swarm optimization (PSO) is proposed to model driver's steering behavior. Initially, the vehicle dynamics is represented by a Takagi-Sugeno (T-S) fuzzy model which provides a method of approximating Q-function. Then the driver behavior model is described as an optimal control policy with decision-making model which illustrates the driving style. Subsequently, the Q-function is approximated by a quadratic polynomial-in-memberships form and the PSO algorithm is used to obtain the decision-making model from the driving data. And the corresponding optimal control policy is obtained by using the Q-learning policy iteration method. Finally, a numerical simulation is carried to show the effectiveness of the proposed method.
机构:
Qinghai Normal Univ, Sch Comp Sci & Technol, Xining 810008, Peoples R ChinaQinghai Normal Univ, Sch Comp Sci & Technol, Xining 810008, Peoples R China
Meng, Xiaoding
Li, Hecheng
论文数: 0引用数: 0
h-index: 0
机构:
Qinghai Normal Univ, Sch Math & Stat, Xining 810008, Peoples R China
Acad Plateau Sci & Sustainabil, Xining 810008, Peoples R ChinaQinghai Normal Univ, Sch Comp Sci & Technol, Xining 810008, Peoples R China
Li, Hecheng
Chen, Anshan
论文数: 0引用数: 0
h-index: 0
机构:
Qinghai Normal Univ, Sch Math & Stat, Xining 810008, Peoples R ChinaQinghai Normal Univ, Sch Comp Sci & Technol, Xining 810008, Peoples R China