POMDPs in Continuous Time and Discrete Spaces

被引:0
|
作者
Alt, Bastian [1 ]
Schultheis, Matthias [1 ,2 ]
Koeppl, Heinz [1 ,2 ]
机构
[1] Tech Univ Darmstadt, Dept Elect Engn & Informat Technol, Darmstadt, Germany
[2] Tech Univ Darmstadt, Ctr Cognit Sci, Darmstadt, Germany
基金
欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many processes, such as discrete event systems in engineering or population dynamics in biology, evolve in discrete space and continuous time. We consider the problem of optimal decision making in such discrete state and action space systems under partial observability. This places our work at the intersection of optimal filtering and optimal control. At the current state of research, a mathematical description for simultaneous decision making and filtering in continuous time with finite state and action spaces is still missing. In this paper, we give a mathematical description of a continuous-time partial observable Markov decision process (POMDP). By leveraging optimal filtering theory we derive a Hamilton-Jacobi-Bellman (HJB) type equation that characterizes the optimal solution. Using techniques from deep learning we approximately solve the resulting partial integro-differential equation. We present (i) an approach solving the decision problem offline by learning an approximation of the value function and (ii) an online algorithm which provides a solution in belief space using deep reinforcement learning. We show the applicability on a set of toy examples which pave the way for future methods providing solutions for high dimensional problems.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] DISCRETE TIME REPRESENTATION OF CONTINUOUS TIME ARMA PROCESSES
    Chambers, Marcus J.
    Thornton, Michael A.
    ECONOMETRIC THEORY, 2012, 28 (01) : 219 - 238
  • [42] On the theory of discrete-time signals of the discrete/continuous type
    Johnson, CD
    PROCEEDINGS OF THE 35TH SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY, 2003, : 113 - 121
  • [43] Can discrete time make continuous space look discrete?
    Mazzola, Claudio
    EUROPEAN JOURNAL FOR PHILOSOPHY OF SCIENCE, 2014, 4 (01) : 19 - 30
  • [44] DISCRETE VS CONTINUOUS-TIME FORMULATIONS OF DISCRETE DYNAMICS
    DADIC, I
    PISK, K
    NUOVO CIMENTO DELLA SOCIETA ITALIANA DI FISICA B-BASIC TOPICS IN PHYSICS, 1983, 73 (01): : 86 - 90
  • [45] Can discrete time make continuous space look discrete?
    Claudio Mazzola
    European Journal for Philosophy of Science, 2014, 4 : 19 - 30
  • [46] Offline RL with Discrete Proxy Representations for Generalizability in POMDPs
    Gu, Pengjie
    Cai, Xinyu
    Xing, Dong
    Wang, Xinrun
    Zhao, Mengchen
    An, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [47] On some continuous and discrete equations in Banach spaces on unbounded intervals
    Kubiaczyk, I
    Majcher, P
    APPLIED MATHEMATICS AND COMPUTATION, 2003, 136 (2-3) : 463 - 473
  • [48] Collaborative Training of Gans in Continuous and Discrete Spaces for Text Generation
    Kim, Yanghoon
    Won, Seungpil
    Yoon, Seunghyun
    Jung, Kyomin
    IEEE ACCESS, 2020, 8 : 226515 - 226523
  • [49] ?ech closure spaces: A unified framework for discrete and continuous homotopy
    Rieser, Antonio
    TOPOLOGY AND ITS APPLICATIONS, 2021, 296
  • [50] Continuous phase-space methods on discrete phase spaces
    Zunkovic, Bojan
    EPL, 2015, 112 (01)