Piecewise linear value function approximation for factored MDPs

被引：0

作者：

Poupart, P ^{[1
]}

Boutilier, C ^{[1
]}

Patrascu, R ^{[1
]}

Schuurmans, D ^{[1
]}

机构：

[1] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3H5, Canada

来源：

EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS | 2002年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A number of proposals have been put forth in recent years for the solution of Markov decision processes (MDPs) whose state (and sometimes action) spaces are factored. One recent class of methods involves linear value function approximation, where the optimal value function is assumed to be a linear combination of some set of basis functions, with the aim of finding suitable weights. While sophisticated techniques have been developed for finding the best approximation within this constrained space, few methods have been proposed for choosing a suitable basis set, or modifying it if solution quality is found wanting. We propose a general framework, and specific proposals, that address. both of,these questions. In particular, we examine weakly coupled MDPS where a number of subtasks can be viewed independently modulo resource constraints. We then describe. methods for constructing a piecewise linear combination of the subtask value. functions, using greedy decision tree techniques. We argue that this architecture is suitable for many types of MDPs whose combinatorics are determined largely by the existence multiple conflicting objectives.

引用

页码：292 / 299

页数：8

共 50 条

[21] Discovering hidden structure in factored MDPs
Kolobov, Andrey
Mausam
Weld, Daniel S.
ARTIFICIAL INTELLIGENCE, 2012, 189 : 19 - 47
[22] An Optimized Method for Nonlinear Function Approximation Based on Multiplierless Piecewise Linear Approximation
Yu, Hongjiang
Yuan, Guoshun
Kong, Dewei
Lei, Lei
He, Yuefeng
APPLIED SCIENCES-BASEL, 2022, 12 (20):
[23] A max-piecewise-linear neural network for function approximation
Wen, Chengtao
Ma, Xiaoyan
NEUROCOMPUTING, 2008, 71 (4-6) : 843 - 852
[24] A basis-function canonical piecewise-linear approximation
Wen, Chengtao
Ma, Xiaoyan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2008, 55 (05) : 1328 - 1334
[25] Softsign Function Hardware Implementation Using Piecewise Linear Approximation
Chang, Chih-Hsiang
Zhang, En-Hui
Huang, Shih-Hsu
2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
[26] Models and Algorithms for Optimal Piecewise-Linear Function Approximation
Camponogara, Eduardo
Nazari, Luiz Fernando
MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
[27] Piecewise linear approximation applied to nonlinear function of a neural network
Amin, H
Curtis, KM
Hayes-Gill, BR
IEE PROCEEDINGS-CIRCUITS DEVICES AND SYSTEMS, 1997, 144 (06): : 313 - 317
[28] Efficient algorithms for function approximation with piecewise linear sigmoidal networks
Hush, DR
Horne, B
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (06): : 1129 - 1141
[29] An analysis of value function learning with piecewise linear control
Tutsoy, Onder
Brown, Martin
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2016, 28 (03) : 529 - 545
[30] Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation
Wei, Chen-Yu
Jafarnia-Jahromi, Mehdi
Luo, Haipeng
Jain, Rahul
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130

← 1 2 3 4 5 →