Over-parameterized Deep Nonparametric Regression for Dependent Data with Its Applications to Reinforcement Learning

被引:0
|
作者
Feng, Xingdong [1 ]
Jiao, Yuling [2 ]
Kang, Lican [3 ]
Zhang, Baqun [1 ]
Zhou, Fan [1 ]
机构
[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R China
[2] Wuhan Univ, Hubei Key Lab Computat Sci, Sch Math & Stat, Wuhan, Peoples R China
[3] Wuhan Univ, Sch Math & Stat, Wuhan, Peoples R China
基金
中国国家自然科学基金; 上海市科技启明星计划;
关键词
Deep reinforcement learning; Low-dimensional Riemannian manifold; Penalized regression; beta-mixing; NEURAL-NETWORKS; GENERALIZATION ERROR; POLICY ITERATION; APPROXIMATION; BOUNDS; CONVERGENCE; SYSTEMS; RATES; GAME;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we provide statistical guarantees for over-parameterized deep nonparametric regression in the presence of dependent data. By decomposing the error, we establish non-asymptotic error bounds for deep estimation, which is achieved by effectively balancing the approximation and generalization errors. We have derived an approximation result for Holder functions with constrained weights. Additionally, the generalization error is bounded by the weight norm, allowing for a neural network parameter number that is much larger than the training sample size. Furthermore, we address the issue of the curse of dimensionality by assuming that the samples originate from distributions with low intrinsic dimensions. Under this assumption, we are able to overcome the challenges posed by high-dimensional spaces. By incorporating an additional error propagation mechanism, we derive oracle inequalities for the over-parameterized deep fitted Q-iteration.
引用
收藏
页数:40
相关论文
共 50 条
  • [21] How SGD Selects the Global Minima in Over-parameterized Learning: A Dynamical Stability Perspective
    Wu, Lei
    Ma, Chao
    Weinan, E.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [22] Review of deep reinforcement learning and its applications in military field
    Zhang M.
    Dou Y.
    Chen Z.
    Jiang J.
    Yang K.
    Ge B.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2024, 46 (04): : 1297 - 1308
  • [23] Metrics for Assessing Generalization of Deep Reinforcement Learning in Parameterized Environments
    Aleksandrowicz, Maciej
    Jaworek-Korjakowska, Joanna
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2024, 14 (01) : 45 - 61
  • [24] Deep Reinforcement Learning with Parameterized Action Space for Object Detection
    Wu, Zheng
    Khan, Naimul Mefraz
    Gao, Lei
    Guan, Ling
    2018 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2018), 2018, : 101 - 104
  • [25] Parameterized deep reinforcement learning with hybrid action space for energy efficient data center networks
    Wang, Ting
    Cheng, Kai
    Du, Xiao
    Cai, Haibin
    Wang, Yang
    COMPUTER NETWORKS, 2023, 235
  • [26] ON DEEP LEARNING AS A REMEDY FOR THE CURSE OF DIMENSIONALITY IN NONPARAMETRIC REGRESSION
    Bauer, Benedikt
    Kohler, Michael
    ANNALS OF STATISTICS, 2019, 47 (04): : 2261 - 2285
  • [27] Deep Inverse Reinforcement Learning by Logistic Regression
    Uchibe, Eiji
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT I, 2016, 9947 : 23 - 31
  • [28] Uniform convergence of estimator for nonparametric regression with dependent data
    Xiaoqin Li
    Wenzhi Yang
    Shuhe Hu
    Journal of Inequalities and Applications, 2016
  • [29] Uniform convergence of estimator for nonparametric regression with dependent data
    Li, Xiaoqin
    Yang, Wenzhi
    Hu, Shuhe
    JOURNAL OF INEQUALITIES AND APPLICATIONS, 2016,
  • [30] Nonparametric estimation of expectile regression in functional dependent data
    Almanjahie, Ibrahim M.
    Bouzebda, Salim
    Kaid, Zoulikha
    Laksaci, Ali
    JOURNAL OF NONPARAMETRIC STATISTICS, 2022, 34 (01) : 250 - 281