Over-parameterized Deep Nonparametric Regression for Dependent Data with Its Applications to Reinforcement Learning

被引：0

作者：

Feng, Xingdong ^{[1
]}

Jiao, Yuling ^{[2
]}

Kang, Lican ^{[3
]}

Zhang, Baqun ^{[1
]}

Zhou, Fan ^{[1
]}

机构：

[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R China

[2] Wuhan Univ, Hubei Key Lab Computat Sci, Sch Math & Stat, Wuhan, Peoples R China

[3] Wuhan Univ, Sch Math & Stat, Wuhan, Peoples R China

来源：

JOURNAL OF MACHINE LEARNING RESEARCH | 2023年 / 24卷

基金：

中国国家自然科学基金; 上海市科技启明星计划;

关键词：

Deep reinforcement learning; Low-dimensional Riemannian manifold; Penalized regression; beta-mixing; NEURAL-NETWORKS; GENERALIZATION ERROR; POLICY ITERATION; APPROXIMATION; BOUNDS; CONVERGENCE; SYSTEMS; RATES; GAME;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we provide statistical guarantees for over-parameterized deep nonparametric regression in the presence of dependent data. By decomposing the error, we establish non-asymptotic error bounds for deep estimation, which is achieved by effectively balancing the approximation and generalization errors. We have derived an approximation result for Holder functions with constrained weights. Additionally, the generalization error is bounded by the weight norm, allowing for a neural network parameter number that is much larger than the training sample size. Furthermore, we address the issue of the curse of dimensionality by assuming that the samples originate from distributions with low intrinsic dimensions. Under this assumption, we are able to overcome the challenges posed by high-dimensional spaces. By incorporating an additional error propagation mechanism, we derive oracle inequalities for the over-parameterized deep fitted Q-iteration.

引用

页数：40

共 50 条

[41] NONPARAMETRIC-ESTIMATION FOR REGRESSION WITH DEPENDENT DATA - APPLICATION TO THE PREVISION
LAIB, N
COMPTES RENDUS DE L ACADEMIE DES SCIENCES SERIE I-MATHEMATIQUE, 1993, 317 (12): : 1173 - 1177
[42] Nonparametric regression for dependent data in the errors-in-variables problem
Honda, Toshio
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (11) : 3409 - 3424
[43] Strong consistency of the internal estimator of nonparametric regression with dependent data
Shen, Jia
Xie, Yuan
STATISTICS & PROBABILITY LETTERS, 2013, 83 (08) : 1915 - 1925
[44] Nonparametric regression estimation for dependent functional data: asymptotic normality
Masry, E
STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2005, 115 (01) : 155 - 177
[45] Consistency of the recursive nonparametric regression estimation for dependent functional data
Amiri, Aboubacar
Thiam, Baba
JOURNAL OF NONPARAMETRIC STATISTICS, 2014, 26 (03) : 471 - 487
[46] Deep reinforcement learning and its applications in medical imaging and radiation therapy: a survey
Xu, Lanyu
Zhu, Simeng
Wen, Ning
PHYSICS IN MEDICINE AND BIOLOGY, 2022, 67 (22):
[47] On complete convergence for widely orthant-dependent random variables and its applications in nonparametric regression models
Wang, Xuejun
Xu, Chen
Hu, Tien-Chung
Volodin, Andrei
Hu, Shuhe
TEST, 2014, 23 (03) : 607 - 629
[48] On complete convergence for widely orthant-dependent random variables and its applications in nonparametric regression models
Xuejun Wang
Chen Xu
Tien-Chung Hu
Andrei Volodin
Shuhe Hu
TEST, 2014, 23 : 607 - 629
[49] Parameterized Adaptive Controller Design using Reinforcement Learning and Deep Neural Networks
Kumar, Kranthi P.
Detroja, Ketan P.
2022 EIGHTH INDIAN CONTROL CONFERENCE, ICC, 2022, : 121 - 126
[50] Overview of Deep Reinforcement Learning Improvements and Applications
Zhang, Junjie
Zhang, Cong
Chien, Wei-Che
JOURNAL OF INTERNET TECHNOLOGY, 2021, 22 (02): : 239 - 255

← 1 2 3 4 5 →