Discounted fully probabilistic design of decision rules

被引：0

作者：

Karny, Miroslav ^{[1
]}

Molnarova, Sona ^{[1
]}

机构：

[1] Czech Acad Sci, Inst Informat Theory & Automat, Dept Adapt Syst, Vodarenskou Vezi 4, Prague 18200 8, Czech Republic

来源：

INFORMATION SCIENCES | 2025年 / 690卷

关键词：

Design principles; Kullback-Leibler's divergence; Probabilistic techniques; Discounting; Closed loop; DIVERGENCE; PREFERENCE;

D O I：

10.1016/j.ins.2024.121578

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Axiomatic fully probabilistic design (FPD) of optimal decision rules strictly extends the decision making (DM) theory represented by Markov decision processes (MDP). This means that any MDP task can be approximated by an explicitly found FPD task whereas many FPD tasks have no MDP equivalent. MDP and FPD model the closed loop - the coupling of an agent and its environment - via a joint probability density (pd) relating the involved random variables, referred to as behaviour. Unlike MDP, FPD quantifies agent's aims and constraints by an ideal pd. The ideal pd is high on the desired behaviours, small on undesired behaviours and zero on forbidden ones. FPD selects the optimal decision rules as the minimiser of Kullback-Leibler's divergence of the closed-loop-modelling pd to its ideal twin. The proximity measure choice follows from the FPD axiomatics. MDP minimises the expected total loss, which is usually the sum of discounted partial losses. The discounting reflects the decreasing importance of future losses. It also diminishes the influence of errors caused by: the imperfection of the employed environment model; roughly-expressed aims; the approximate learning and decision-rules design. The established FPD cannot currently account for these important features. The paper elaborates the missing discounted version of FPD. This non-trivial filling of the gap in FPD also employs an extension of dynamic programming, which is of an independent interest.

引用

页数：12

共 50 条

[31] DECISION TREE DESIGN USING A PROBABILISTIC MODEL
CASEY, RG
NAGY, G
IEEE TRANSACTIONS ON INFORMATION THEORY, 1984, 30 (01) : 93 - 99
[32] Rules for trajectory updating in decision based design
Vijitashwa Pandey
Zissimos P. Mourelatos
Structural and Multidisciplinary Optimization, 2014, 49 : 49 - 57
[33] Rules for trajectory updating in decision based design
Pandey, Vijitashwa
Mourelatos, Zissimos P.
STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2014, 49 (01) : 49 - 57
[34] Decision support rules for profile design at intersections
Lee, Jim
Movassaghi, Kam K.
Kumat, Amit
Journal of Transportation Engineering, 1995, 121 (05): : 391 - 396
[35] Why Rules are Complex: Real-Valued Probabilistic Logic Programs are not Fully Expressive
Buchman, David
Poole, David
CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
[36] A Fully Probabilistic Decentralised Control Design for Complex Stochastic Systems
Herzallah, Randa
2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, : 806 - 811
[37] Generalised Fully Probabilistic Controller Design For Nonlinear Affine Systems
Zafar, Ana
Herzallah, Randa
2019 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2019, : 137 - 144
[38] Decentralised fully probabilistic design for stochastic networks with multiplicative noise
Zhou, Yuyang
Herzallah, Randa
Zhang, Qichun
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2023, 54 (08) : 1841 - 1854
[39] Fully Probabilistic Design for Knowledge Transfer in a Pair of Kalman Filters
Foley, Conor
Quinn, Anthony
IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (04) : 487 - 490
[40] Fully Probabilistic Design for Stochastic Discrete System with Multiplicative Noise
Zhou, Yuyang
Herzallah, Randa
Zafar, Ana
2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, : 940 - 945

← 1 2 3 4 5 →