首页
学术期刊
论文检测
AIGC检测
热点
更多
数据
AVERAGE, SENSITIVE AND BLACKWELL OPTIMAL POLICIES IN DENUMERABLE MARKOV DECISION CHAINS WITH UNBOUNDED REWARDS
被引:34
|
作者
:
DEKKER, R
论文数:
0
引用数:
0
h-index:
0
机构:
STATE UNIV LEIDEN,INST APPL MATH & COMP SCI,2312 AV LEIDEN,NETHERLANDS
STATE UNIV LEIDEN,INST APPL MATH & COMP SCI,2312 AV LEIDEN,NETHERLANDS
DEKKER, R
[
1
]
HORDIJK, A
论文数:
0
引用数:
0
h-index:
0
机构:
STATE UNIV LEIDEN,INST APPL MATH & COMP SCI,2312 AV LEIDEN,NETHERLANDS
STATE UNIV LEIDEN,INST APPL MATH & COMP SCI,2312 AV LEIDEN,NETHERLANDS
HORDIJK, A
[
1
]
机构
:
[1]
STATE UNIV LEIDEN,INST APPL MATH & COMP SCI,2312 AV LEIDEN,NETHERLANDS
来源
:
MATHEMATICS OF OPERATIONS RESEARCH
|
1988年
/ 13卷
/ 03期
关键词
:
D O I
:
10.1287/moor.13.3.395
中图分类号
:
C93 [管理学];
O22 [运筹学];
学科分类号
:
070105 ;
12 ;
1201 ;
1202 ;
120202 ;
摘要
:
引用
收藏
页码:395 / 420
页数:26
相关论文
共 50 条
[1]
The computation of average optimal policies in denumerable state Markov decision chains
Sennott, LI
论文数:
0
引用数:
0
h-index:
0
Sennott, LI
ADVANCES IN APPLIED PROBABILITY,
1997,
29
(01)
: 114
-
137
[2]
CONDITIONS FOR EXISTENCE OF AVERAGE AND BLACKWELL OPTIMAL STATIONARY POLICIES IN DENUMERABLE MARKOV DECISION-PROCESSES
LASSERRE, JB
论文数:
0
引用数:
0
h-index:
0
LASSERRE, JB
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS,
1988,
136
(02)
: 479
-
489
[3]
RECURRENCE CONDITIONS FOR AVERAGE AND BLACKWELL OPTIMALITY IN DENUMERABLE STATE MARKOV DECISION CHAINS
DEKKER, R
论文数:
0
引用数:
0
h-index:
0
机构:
LEIDEN UNIV,INST APPL MATH & COMP SCI,2300 RA LEIDEN,NETHERLANDS
LEIDEN UNIV,INST APPL MATH & COMP SCI,2300 RA LEIDEN,NETHERLANDS
DEKKER, R
HORDIJK, A
论文数:
0
引用数:
0
h-index:
0
机构:
LEIDEN UNIV,INST APPL MATH & COMP SCI,2300 RA LEIDEN,NETHERLANDS
LEIDEN UNIV,INST APPL MATH & COMP SCI,2300 RA LEIDEN,NETHERLANDS
HORDIJK, A
MATHEMATICS OF OPERATIONS RESEARCH,
1992,
17
(02)
: 271
-
289
[4]
Blackwell optimality in the class of stationary policies in Markov decision chains with a Borel state space and unbounded rewards
Hordijk, A
论文数:
0
引用数:
0
h-index:
0
机构:
Leiden Univ, Dept Math & Comp Sci, NL-2300 RA Leiden, Netherlands
Leiden Univ, Dept Math & Comp Sci, NL-2300 RA Leiden, Netherlands
Hordijk, A
Yushkevich, AA
论文数:
0
引用数:
0
h-index:
0
机构:
Leiden Univ, Dept Math & Comp Sci, NL-2300 RA Leiden, Netherlands
Yushkevich, AA
MATHEMATICAL METHODS OF OPERATIONS RESEARCH,
1999,
49
(01)
: 1
-
39
[5]
Characterization of the Optimal Risk-Sensitive Average Cost in Denumerable Markov Decision Chains
Cavazos-Cadena, Rolando
论文数:
0
引用数:
0
h-index:
0
机构:
Univ Autonoma Agr Antonio Narro, Dept Estadist & Calculo, Saltillo 25315, Coahuila, Mexico
Univ Autonoma Agr Antonio Narro, Dept Estadist & Calculo, Saltillo 25315, Coahuila, Mexico
Cavazos-Cadena, Rolando
MATHEMATICS OF OPERATIONS RESEARCH,
2018,
43
(03)
: 1025
-
1050
[6]
Blackwell optimality in the class of stationary policies in Markov decision chains with a Borel state space and unbounded rewards
Arie Hordijk
论文数:
0
引用数:
0
h-index:
0
机构:
Department of Mathematics and Computer Science,
Arie Hordijk
Alexander A. Yushkevich
论文数:
0
引用数:
0
h-index:
0
机构:
Department of Mathematics and Computer Science,
Alexander A. Yushkevich
Mathematical Methods of Operations Research,
1999,
49
(1)
: 1
-
39
[7]
Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards
Arie Hordijk
论文数:
0
引用数:
0
h-index:
0
机构:
Department of Mathematics and Computer Science,
Arie Hordijk
Alexander A. Yushkevich
论文数:
0
引用数:
0
h-index:
0
机构:
Department of Mathematics and Computer Science,
Alexander A. Yushkevich
Mathematical Methods of Operations Research,
1999,
50
: 421
-
448
[8]
Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards
Hordijk, A
论文数:
0
引用数:
0
h-index:
0
机构:
Leiden Univ, Dept Math & Comp Sci, NL-2300 RA Leiden, Netherlands
Leiden Univ, Dept Math & Comp Sci, NL-2300 RA Leiden, Netherlands
Hordijk, A
Yushkevich, AA
论文数:
0
引用数:
0
h-index:
0
机构:
Leiden Univ, Dept Math & Comp Sci, NL-2300 RA Leiden, Netherlands
Yushkevich, AA
MATHEMATICAL METHODS OF OPERATIONS RESEARCH,
1999,
50
(03)
: 421
-
448
[9]
WEAK CONDITIONS FOR THE EXISTENCE OF OPTIMAL STATIONARY POLICIES IN AVERAGE MARKOV DECISION CHAINS WITH UNBOUNDED COSTS
CAVAZOSCADENA, R
论文数:
0
引用数:
0
h-index:
0
机构:
TEXAS TECH UNIV,DEPT MATH,LUBBOCK,TX 79409
TEXAS TECH UNIV,DEPT MATH,LUBBOCK,TX 79409
CAVAZOSCADENA, R
KYBERNETIKA,
1989,
25
(03)
: 145
-
156
[10]
DENUMERABLE UNDISCOUNTED SEMI-MARKOV DECISION-PROCESSES WITH UNBOUNDED REWARDS
FEDERGRUEN, A
论文数:
0
引用数:
0
h-index:
0
机构:
UNIV ROCHESTER,GRAD SCH MANAGEMENT,ROCHESTER,NY 14627
FEDERGRUEN, A
SCHWEITZER, PJ
论文数:
0
引用数:
0
h-index:
0
机构:
UNIV ROCHESTER,GRAD SCH MANAGEMENT,ROCHESTER,NY 14627
SCHWEITZER, PJ
TIJMS, HC
论文数:
0
引用数:
0
h-index:
0
机构:
UNIV ROCHESTER,GRAD SCH MANAGEMENT,ROCHESTER,NY 14627
TIJMS, HC
MATHEMATICS OF OPERATIONS RESEARCH,
1983,
8
(02)
: 298
-
313
←
1
2
3
4
5
→