Consolidated actor-critic model for partially-observable Markov decision processes

被引:0
|
作者
Elhanany, I. [1 ]
Niedzwiedz, C. [1 ]
Liu, Z.
Livingston, S. [1 ]
机构
[1] Univ Tennessee, Dept Elect Engn & Comp Sci, Knoxville, TN 37996 USA
关键词
D O I
10.1049/el:20081346
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A method for consolidating the traditionally separate actor and critic neural networks in temporal difference learning for addressing partially-observable Markov decision processes (POMDPs) is presented. Simulation results for solving a five-state POMDP problem support the claim that the consolidated model achieves higher performance while reducing computational and storage requirements to approximately half those of the traditional approach.
引用
收藏
页码:1317 / U41
页数:2
相关论文
共 50 条