Over-parameterized Deep Nonparametric Regression for Dependent Data with Its Applications to Reinforcement Learning

被引：0

作者：

Feng, Xingdong ^{[1
]}

Jiao, Yuling ^{[2
]}

Kang, Lican ^{[3
]}

Zhang, Baqun ^{[1
]}

Zhou, Fan ^{[1
]}

机构：

[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R China

[2] Wuhan Univ, Hubei Key Lab Computat Sci, Sch Math & Stat, Wuhan, Peoples R China

[3] Wuhan Univ, Sch Math & Stat, Wuhan, Peoples R China

来源：

JOURNAL OF MACHINE LEARNING RESEARCH | 2023年 / 24卷

基金：

中国国家自然科学基金; 上海市科技启明星计划;

关键词：

Deep reinforcement learning; Low-dimensional Riemannian manifold; Penalized regression; beta-mixing; NEURAL-NETWORKS; GENERALIZATION ERROR; POLICY ITERATION; APPROXIMATION; BOUNDS; CONVERGENCE; SYSTEMS; RATES; GAME;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we provide statistical guarantees for over-parameterized deep nonparametric regression in the presence of dependent data. By decomposing the error, we establish non-asymptotic error bounds for deep estimation, which is achieved by effectively balancing the approximation and generalization errors. We have derived an approximation result for Holder functions with constrained weights. Additionally, the generalization error is bounded by the weight norm, allowing for a neural network parameter number that is much larger than the training sample size. Furthermore, we address the issue of the curse of dimensionality by assuming that the samples originate from distributions with low intrinsic dimensions. Under this assumption, we are able to overcome the challenges posed by high-dimensional spaces. By incorporating an additional error propagation mechanism, we derive oracle inequalities for the over-parameterized deep fitted Q-iteration.

引用

页数：40

共 50 条

[21] How SGD Selects the Global Minima in Over-parameterized Learning: A Dynamical Stability Perspective
Wu, Lei
Ma, Chao
Weinan, E.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[22] Review of deep reinforcement learning and its applications in military field
Zhang M.
Dou Y.
Chen Z.
Jiang J.
Yang K.
Ge B.
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2024, 46 (04): : 1297 - 1308
[23] Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments
Aleksandrowicz, Maciej
Jaworek-Korjakowska, Joanna
JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2024, 14 (01) : 45 - 61
[24] Deep Reinforcement Learning with Parameterized Action Space for Object Detection
Wu, Zheng
Khan, Naimul Mefraz
Gao, Lei
Guan, Ling
2018 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2018), 2018, : 101 - 104
[25] Parameterized deep reinforcement learning with hybrid action space for energy efficient data center networks
Wang, Ting
Cheng, Kai
Du, Xiao
Cai, Haibin
Wang, Yang
COMPUTER NETWORKS, 2023, 235
[26] ON DEEP LEARNING AS A REMEDY FOR THE CURSE OF DIMENSIONALITY IN NONPARAMETRIC REGRESSION
Bauer, Benedikt
Kohler, Michael
ANNALS OF STATISTICS, 2019, 47 (04): : 2261 - 2285
[27] Deep Inverse Reinforcement Learning by Logistic Regression
Uchibe, Eiji
NEURAL INFORMATION PROCESSING, ICONIP 2016, PT I, 2016, 9947 : 23 - 31
[28] Uniform convergence of estimator for nonparametric regression with dependent data
Xiaoqin Li
Wenzhi Yang
Shuhe Hu
Journal of Inequalities and Applications, 2016
[29] Uniform convergence of estimator for nonparametric regression with dependent data
Li, Xiaoqin
Yang, Wenzhi
Hu, Shuhe
JOURNAL OF INEQUALITIES AND APPLICATIONS, 2016,
[30] Nonparametric estimation of expectile regression in functional dependent data
Almanjahie, Ibrahim M.
Bouzebda, Salim
Kaid, Zoulikha
Laksaci, Ali
JOURNAL OF NONPARAMETRIC STATISTICS, 2022, 34 (01) : 250 - 281

← 1 2 3 4 5 →