Consolidated actor-critic model for partially-observable Markov decision processes

被引：0

作者：

Elhanany, I. ^{[1
]}

Niedzwiedz, C. ^{[1
]}

Liu, Z.

Livingston, S. ^{[1
]}

机构：

[1] Univ Tennessee, Dept Elect Engn & Comp Sci, Knoxville, TN 37996 USA

来源：

ELECTRONICS LETTERS | 2008年 / 44卷 / 22期

关键词：

D O I：

10.1049/el:20081346

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A method for consolidating the traditionally separate actor and critic neural networks in temporal difference learning for addressing partially-observable Markov decision processes (POMDPs) is presented. Simulation results for solving a five-state POMDP problem support the claim that the consolidated model achieves higher performance while reducing computational and storage requirements to approximately half those of the traditional approach.

引用

页码：1317 / U41

页数：2

共 50 条

[31] Decision making under uncertainty: a neural model based on partially observable Markov decision processes
Rao, Rajesh P. N.
FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2010, 4
[32] PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES AND PERIODIC POLICIES WITH APPLICATIONS
Goulionis, John
Stengos, D.
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2011, 10 (06) : 1175 - 1197
[33] An Argument for the Bayesian Control of Partially Observable Markov Decision Processes
Vargo, Erik
Cogill, Randy
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (10) : 2796 - 2800
[34] Partially observable Markov decision processes for spoken dialog systems
Williams, Jason D.
Young, Steve
COMPUTER SPEECH AND LANGUAGE, 2007, 21 (02): : 393 - 422
[35] Learning deterministic policies in partially observable Markov decision processes
Miyazaki, K
Kobayashi, S
INTELLIGENT AUTONOMOUS SYSTEMS: IAS-5, 1998, : 250 - 257
[36] Nonmyopic multiaspect sensing with partially observable Markov decision processes
Ji, Shihao
Parr, Ronald
Carin, Lawrence
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (06) : 2720 - 2730
[37] A Fast Approximation Method for Partially Observable Markov Decision Processes
Bingbing Liu
Yu Kang
Xiaofeng Jiang
Jiahu Qin
Journal of Systems Science and Complexity, 2018, 31 : 1423 - 1436
[38] Partially Observable Markov Decision Processes incorporating epistemic uncertainties
Faddoul, R.
Raphael, W.
Soubra, A. -H.
Chateauneuf, A.
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 241 (02) : 391 - 401
[39] STRUCTURAL RESULTS FOR PARTIALLY OBSERVABLE MARKOV DECISION-PROCESSES
ALBRIGHT, SC
OPERATIONS RESEARCH, 1979, 27 (05) : 1041 - 1053
[40] MEDICAL TREATMENTS USING PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES
Goulionis, John E.
JP JOURNAL OF BIOSTATISTICS, 2009, 3 (02) : 77 - 97

← 1 2 3 4 5 →